Board logo

标题: 有没有想过在广告过滤里实现对正则表达的支持? [打印本页]

作者: cntime    时间: 2006-5-15 17:18     标题: 有没有想过在广告过滤里实现对正则表达的支持?

RT,这样这里面的规则就全部可用了
地址在这里http://nb.21windows.cn/blog/?action=show&id=150
作者: cntime    时间: 2006-5-15 17:19

把规则也转过来吧省得大家时间
*.4se*link.com/hitsin.asp?link=**
*.ad*smostvisited.com*
*.ad.*
*.ad4all.*
*.adbull.de*
*.ads.*
*.adverline.co*/clic?id=**
*.ad-*
*.allyes.com/*
*.a-counter.com*
*.baren*de.com/tts-cgi/tts-in.cgi?*
*.bfast.com*
*.carpediem.fr/*.exe*
*.celebrity*ut.net*.exe*
*.cinema.de/forwarder/*
*.com/?ref=**
*.com/?revid=**
*.com/admanager/*id=**
*.com/advertising/redirect.asp?*url=*http://*
*.com/affiliates/a.asp?*
*.com/bin/xpromo.x/*
*.com/bjump.php?id=**
*.com/cgi-bin/click.cgi?*
*.com/cgi-bin/clk.cgi?*
*.com/cgi-bin/in?*
*.com/cgi-bin/refer.cgi?*
*.com/cgi-bin/topvlog.cgi?*
*.com/cgi-bin/tracker.cgi?*
*.com/click.php?*
*.com/click?id=**&site=**
*.com/clicks.asp?*id=**
*.com/default.asp?ref=**
*.com/default.asp?url=*http*
*.com/default.html?did=**
*.com/default.php?affid=**
*.com/enter.html?id=**
*.com/gate.asp?account=**
*.com/home.html?*id=**id=**
*.com/in.pl?id=**
*.com/index.asp?associateid=**
*.com/index.html?a=**
*.com/index.php?id=**
*.com/index.php?r=**
*.com/index.php?tl_id=**
*.com/insertjump.php?id=**
*.com/join/index.html?*
*.com/php/redirect.php/*
*.com/pps=**
*.com/ps/ct/count.cgi?a=**
*.com/pt=**
*.com/r/partners/*
*.com/redir?*advid=**
*.com/signup/?ref=**
*.com/site_in.php?id=**
*.com/specialtrial/?wm_login=**
*.com/toplist.asp*id=**
*.com/tracker.cgi?*
*.com/tracksponsor.aspx?*id=**
*.de/impressum.htm?verl_id=**
*.eads.com*?*
*.elitecash.*
*.exitfuel.*?*
*.fastclick.*
*.flycast.*
*.getmor*men.com/cgi-bin/a.pl?*
*.hitbox.*
*.hornyheidi.com*
*.hotlog.ru*
*.hypercount.*
*.iclicks.de*
*.igallery.*
*.link4ads.*
*.linkexchange.*
*.linux.org/perl-bin/invoke?obj=**
*.logging.to/log.php?account=**&url=*http://*
*.maximumcash.*=**
*.nedstatbasic.*
*.net/rating/in.php?*
*.payserve.*/cash.cgi?*
*.pcavs.com/getid.phtml?k=**
*.pus*list.com/cgi-bin/top/account.cgi?*
*.qksrv.net*
*.respond.com/*/sellers.jsp?src=**
*.screensaver.com/advantage/cthrough.asp?*
*.se*swap2.com*
*.speedyclick.*=**
*.spylog.*
*.stats4all.*
*.thecounter.com*
*.topping.com.ua/*
*.uk/redirect.srf?id=**
*.ultravideos.com*
*.valueclick.*?*
*.virtual*xmall.com/partners/*/index.html*
*.webcounter.goweb.*
*.webtrendslive.*?*
*.zanox-affiliate.de/bin/z_ct_trc.dll?*
*/*click*&adid=**
*/.sbean?bean=**
*//access.adultterra.com/*
*//action.ientry.net/?rc=**
*//ad*.bannerbank.*
*//ad*/redir.cgi?*
*//ad.*?*
*//adlik.*?id=**
*//ads.*
*//adserver.*=**
*//adv.*/ad_page*
*//ar.atwola.com*
*//banner.kiev.ua*
*//bannerads.*
*//cgi*.fxweb.com/v*-*.cgi*
*//click*.oxcash.com/?*=**=**
*//click.*/http://*
*//click.*id=**
*//click.atdmt.*
*//click.silvercash.com/?*
*//clickcash.webpower.com/*=**=**=**
*//clicks.*?*
*//clix.superclix.de*
*//counter*.bravenet.com*
*//counter.*
*//freeporn.adultterra.com*
*//go.*.com/a=**/s=**
*//in.*.com/?s=**&a=**
*//link.siccash.com/cgi-bin/*raw*
*//linktrack.*?id=**
*//newads.*
*//partners.hotgold.com/cgi-bin/*?id=**
*//paymail*/mailally.jsp?id=**
*//rb.rfn.ru/cgi-bin/href*
*//rd.yahoo.com/*//shop.store.yahoo.com*
*//showcount.*?site=**
*//top100.*
*//topsites.*/in.*
*//tracker.tradedoubler.com*
*/007movie*.exe*
*/?revid=**&s=**
*/?tslid=**
*/ad-*
*/ad.*?id=**
*/ad/*
*/ad/admentorredir.asp*
*/ad_click.*
*/ad_target.asp?*
*/adclick*
*/adentry.asp?*
*/adlog.pl?*
*/adman?*id=**id=**
*/adrotate/click*?*
*/ads.pl?*
*/ads?*
*/adsystem/redirect.asp?url=**
*/advert/ads.cgi?*
*/adx_click.html?*
*/affclick.cgi?affid=**
*/affiliates/clickthru.cgi/*
*/aftrack.asp?id=**
*/ap/click?*
*/banmanpro/banman.*
*/banner.asp?*
*/banredir.cgi?*id=**
*/bin/red.x/*bidpay.com/sellerregistrationstep1.asp*
*/bonus.cfm?id=**
*/bpwork.cgi?advert=**
*/bumblecash.cgi?*
*/buyjump.html*
*/cash4adverts.pl?sponsor=**
*/casinolux.exe*
*/category.cfm?id=**&aff=**
*/cgi-bin/ads/*?*
*/cgi-bin/bc.m?count=**&link=*http://*
*/cgi-bin/ccshare.pl*id*
*/cgi-bin/click-*.cgi?*
*/cgi-bin/intellilink/in.cgi?id=**
*/cgi-bin/redirect.cgi/*campaign=**
*/cgi-bin/refer.php?user=**
*/click.cgi?file=**@*
*/click.php*id=**
*/click.php*ref=**
*/click/?*=**
*/click?account=**site=**
*/clickad.jsp?*
*/clickbanner.*
*/clickthru.acc?*
*/count_in.asp?id=**
*/countad2.php?*=**
*/ctc.cgi?*
*/default.asp?vid=**
*/event.ng/type=**
*/getafid.asp*affid=**
*/go.*sponsor_id=**
*/guanggao/js/*
*/hitin.php?id=**
*/holly_celebs.exe*
*/in.php?id=**
*/in.php?thesiteno=**
*/index.*?tid=**
*/j4free/go.php*
*/js/ad/*
*/js/float*.js*
*/jump.zil*outurl=**
*/jws-track.cgi?id=**site=**
*/largecash/*/index.html*
*/lives*x.exe*
*/lspro/lspro.cgi?click=**
*/main.cgi?refererid=**
*/main.htm?id=**
*/maxcash.cgi?*=**
*/nastydollars.htm*
*/oasys/oasisc.php?s=**&c=**
*/out.cgi?smallbanner*
*/pagead/*
*/popup.*
*/r.kds?r=**
*/rankem.cgi*id=**
*/realmedia/ads/*
*/redirect.asp?url=**?pa_name=**
*/redirect.php?adcomp=**&adurl=**
*/refer?*id=**
*/reklama.*
*/revshare/agents.cgi*refer*
*/rs.exe?revshareid=**
*/s/advtrace/*
*/s/banmat*/banmat*.cgi?url=**
*/servlet/ajrotator/*
*/snfrlinkout.php?id=**
*/subscribe.php?site_id=**&src=**
*/tong.php?op=**&h_id=**
*/top/?id=**
*/top100/*
*/toplist.php3?action=*in&login=**
*/topref.cgi?*
*/topsites.cgi*
*/topsites/in.cgi?id=**
*/topsites/sitelist.asp?id=**
*/track.cfm?bannerid=**
*/track/*/redirect.cgi?*
*ads99.net/*
/(\.|\/)(ad|banner|c|sm)(s)?(\d)*(\.|\/|_)/
/(\.|\/)(allyes|pfp|scalink|adssi|\wgg)(s)?(\d)*(\.|\/|_)/
/(\.|\/)(banner|benner|ad|(a\_d(\_)?(s)?)|adv|allyes|adjs|guangao|guanggao|adbot|advert|adclient|adcouncil|adgif|adgraph|adimage|adinfo|adlog|adpic|adrotator|advert|adview|softad|advertisement)(s)?(\d)*(\.|\/|_)/
/(\/)[0-9]{4}\/[0-9]{1,2}\/[0-9]{1,2}(\/).(\.)swf/
/(\/)\w(\.js)/
/(hot|spy)log/
/[\W\d](double|fast)click[\W\d]/
/[\W\d]click(stream|thrutraffic|thru|xchange)[\W\d]/
/[\W\d]dime(xchange|click)[\W\d]/
/[\W\d]value(stream|xchange|click)[\W\d]/
/[\W\d_](top|bottom|left|right)?banner(s|id=|\d|_)[\W\d]/
/[\W_](b(an|nr)s?|jump|redir(ect|s)?|stat)[\W_]/
/[\W_](onlineads?|ad(banner|click|-?flow|frame|ima?g(es?)?|_id|js|log|serv(er|e)?|stream|_string|s|trix|type|vertisements?|v|vert|xchange)?)[\W\d]/
/\/buy_assets\//
/\/img(s)?\/(banner|benner)(\w)+\.(swf|gif|jpg)/
/\D(588|468|234|120)x(600?|120|90)\D/
/\W(cy|r)?c(ou)?nt(er|ed)?\W/
/\wgg/
/p(artner|ing\.cgi|romotion)/
/skycn\.(net|com)\/(tuijian|tuijianimg|js)/
/sp(onsor|ymagic)/
/top(100|cto)/
googlesyndication
作者: zlowly    时间: 2006-5-15 18:42

这些规则,其实大部分都用不到的,而且很多都没有优化合并,效率低得要死。
作者: caten    时间: 2006-5-15 18:54

正则支持很重要  早就说过的~~~~应该也是共识了吧~
作者: mt0003    时间: 2006-5-15 19:01

我也希望theword能在广告拦截上能更优秀
作者: Kinkairy    时间: 2006-5-15 19:57

mt2.0已经支持,相信tw也不会落后的,毕竟这个也算是个趋势
作者: AY    时间: 2006-5-15 22:24

如果只是匹配 url,用通配符便十分足够。如果像 m2 匹配页面源码,正则支持就不能少。
作者: Alandelong    时间: 2006-5-15 22:52

自从试用过MT2.0之后,也觉得这个功能很重要。




欢迎光临 世界之窗论坛 (http://bbs.theworld.cn./) Powered by Discuz! 7.2