# Robots.txt file from http://www.searchengineworld.com # # Built from text file http://info.webcrawler.com/mak/projects/robots/active/all.txt # # This restricts access to only known and registered robots. # User-agent: Googlebot Disallow: / User-agent: Googlebot/2.1 Disallow: / User-agent: SurveyBot/2.3 Disallow: / User-agent: Mozilla/3.0 (compatible;miner;mailto:miner@miner.com.br) Disallow: User-agent: WebFerret Disallow: User-agent: Due to a deficiency in Java it's not currently possible to set the User-agent. Disallow: User-agent: Slurp/2.0 Disallow: / User-agent: ESISmartSpider/2.0 Disallow: User-agent: Snooper/b97_01 Disallow: User-agent: Solbot/1.0 LWP/5.07 Disallow: User-agent: Spanner/1.0 (Linux 2.0.27 i586) Disallow: User-agent: Mozilla/3.0 (Black Widow v1.1.0; Linux 2.0.27; Dec 31 1997 12:25:00 Disallow: User-agent: Tarantula/1.0 Disallow: User-agent: tarspider Disallow: User-agent: dlw3robot/x.y (in TclX by http://hplyot.obspm.fr/~dl/) Disallow: User-agent: Templeton/ Disallow: User-agent: TitIn/0.2 Disallow: User-agent: TITAN/0.1 Disallow: User-agent: UCSD-Crawler Disallow: User-agent: urlck/1.2.3 Disallow: User-agent: Valkyrie/1.0 libwww-perl/0.40 Disallow: User-agent: Victoria/1.0 Disallow: User-agent: vision-search/3.0' Disallow: User-agent: VWbot_K/4.2 Disallow: User-agent: W3M2/x.xxx Disallow: User-agent: WWWWanderer v3.0 Disallow: User-agent: WebCopy/ Disallow: User-agent: WebCrawler/3.0 Robot libwww/5.0a Disallow: User-agent: WebFetcher/0.8, Disallow: User-agent: weblayers/0.0 Disallow: User-agent: WebLinker/0.0 libwww-perl/0.1 Disallow: User-agent: WebMoose/0.0.0000 Disallow: User-agent: Digimarc WebReader/1.2 Disallow: User-agent: webs@recruit.co.jp Disallow: User-agent: webvac/1.0 Disallow: User-agent: webwalk Disallow: User-agent: WebWalker/1.10 Disallow: User-agent: WebWatch Disallow: User-agent: Wget/1.4.0 Disallow: User-agent: w3mir Disallow: User-agent: XGET/0.7 Disallow: User-agent: Nederland.zoek Disallow: User-agent: BizBot04 kirk.overleaf.com Disallow: User-agent: HappyBot (gserver.kw.net) Disallow: User-agent: CaliforniaBrownSpider Disallow: User-agent: EI*Net/0.1 libwww/0.1 Disallow: User-agent: Ibot/1.0 libwww-perl/0.40 Disallow: User-agent: Merritt/1.0 Disallow: User-agent: StatFetcher/1.0 Disallow: User-agent: TeacherSoft/1.0 libwww/2.17 Disallow: User-agent: WWW Collector Disallow: User-agent: processor/0.0ALPHA libwww-perl/0.20 Disallow: User-agent: wobot/1.0 from 206.214.202.45 Disallow: User-agent: Libertech-Rover www.libertech.com? Disallow: User-agent: WhoWhere Robot Disallow: User-agent: ITI Spider Disallow: User-agent: w3index Disallow: User-agent: MyCNNSpider Disallow: User-agent: SummyCrawler Disallow: User-agent: OGspider Disallow: User-agent: linklooker Disallow: User-agent: CyberSpyder (amant@www.cyberspyder.com) Disallow: User-agent: SlowBot Disallow: User-agent: heraSpider Disallow: User-agent: Surfbot Disallow: User-agent: Bizbot003 Disallow: User-agent: WebWalker Disallow: User-agent: SandBot Disallow: User-agent: EnigmaBot Disallow: User-agent: spyder3.microsys.com Disallow: User-agent: www.freeloader.com. Disallow: User-agent: METAGOPHER Disallow: