Block website archivers and email spam collectors
- Bigwebmaster
- Site Admin


- Joined: Dec 20, 2002
- Posts: 8924
- Loc: Seattle, WA & Phoenix, AZ
- Status: Offline
Here is something simple you can do to protect against website archiving programs and email collector programs. One reason you might want to block against these malicious programs is the fact that they hog the resources on your server. Most will request as many pages per second that their computer can handle, and it could seriously bog down or crash your server. To use this you have to be able to use the Limit directive. To block some of the common programs place this in your .htaccess file:
If anybody knows of anymore site archivers or email collectors to block please post it here.
Code: [ Select ]
SetEnvIfNoCase User-Agent "HTTrack" bad_bot
SetEnvIfNoCase User-Agent "Download Ninja 2.0" bad_bot
SetEnvIfNoCase User-Agent "JBH Agent 2.0" bad_bot
SetEnvIfNoCase User-Agent "EmailCollector/1.0" spam_bot
SetEnvIfNoCase User-Agent "EmailSiphon" spam_bot
SetEnvIfNoCase User-Agent "EmailWolf 1.00" spam_bot
SetEnvIfNoCase User-Agent "ExtractorPro" spam_bot
SetEnvIfNoCase User-Agent "Crescent Internet ToolPak HTTP OLE Control v.1.0" spam_bot
SetEnvIfNoCase User-Agent "Mozilla/2.0 (compatible; NEWT ActiveX; Win32)" spam_bot
SetEnvIfNoCase User-Agent "CherryPicker/1.0" spam_bot
SetEnvIfNoCase User-Agent "CherryPickerSE/1.0" spam_bot
SetEnvIfNoCase User-Agent "CherryPickerElite/1.0" spam_bot
SetEnvIfNoCase User-Agent "NICErsPRO" spam_bot
SetEnvIfNoCase User-Agent "WebBandit/2.1" spam_bot
SetEnvIfNoCase User-Agent "WebBandit/3.50" spam_bot
SetEnvIfNoCase User-Agent "webbandit/4.00.0" spam_bot
SetEnvIfNoCase User-Agent "WebEMailExtractor/1.0B" spam_bot
SetEnvIfNoCase User-Agent "autoemailspider" spam_bot
<Limit GET POST HEAD>
Order Allow,Deny
Allow from all
Deny from env=bad_bot
Deny from env=spam_bot
</Limit>
SetEnvIfNoCase User-Agent "Download Ninja 2.0" bad_bot
SetEnvIfNoCase User-Agent "JBH Agent 2.0" bad_bot
SetEnvIfNoCase User-Agent "EmailCollector/1.0" spam_bot
SetEnvIfNoCase User-Agent "EmailSiphon" spam_bot
SetEnvIfNoCase User-Agent "EmailWolf 1.00" spam_bot
SetEnvIfNoCase User-Agent "ExtractorPro" spam_bot
SetEnvIfNoCase User-Agent "Crescent Internet ToolPak HTTP OLE Control v.1.0" spam_bot
SetEnvIfNoCase User-Agent "Mozilla/2.0 (compatible; NEWT ActiveX; Win32)" spam_bot
SetEnvIfNoCase User-Agent "CherryPicker/1.0" spam_bot
SetEnvIfNoCase User-Agent "CherryPickerSE/1.0" spam_bot
SetEnvIfNoCase User-Agent "CherryPickerElite/1.0" spam_bot
SetEnvIfNoCase User-Agent "NICErsPRO" spam_bot
SetEnvIfNoCase User-Agent "WebBandit/2.1" spam_bot
SetEnvIfNoCase User-Agent "WebBandit/3.50" spam_bot
SetEnvIfNoCase User-Agent "webbandit/4.00.0" spam_bot
SetEnvIfNoCase User-Agent "WebEMailExtractor/1.0B" spam_bot
SetEnvIfNoCase User-Agent "autoemailspider" spam_bot
<Limit GET POST HEAD>
Order Allow,Deny
Allow from all
Deny from env=bad_bot
Deny from env=spam_bot
</Limit>
- SetEnvIfNoCase User-Agent "HTTrack" bad_bot
- SetEnvIfNoCase User-Agent "Download Ninja 2.0" bad_bot
- SetEnvIfNoCase User-Agent "JBH Agent 2.0" bad_bot
- SetEnvIfNoCase User-Agent "EmailCollector/1.0" spam_bot
- SetEnvIfNoCase User-Agent "EmailSiphon" spam_bot
- SetEnvIfNoCase User-Agent "EmailWolf 1.00" spam_bot
- SetEnvIfNoCase User-Agent "ExtractorPro" spam_bot
- SetEnvIfNoCase User-Agent "Crescent Internet ToolPak HTTP OLE Control v.1.0" spam_bot
- SetEnvIfNoCase User-Agent "Mozilla/2.0 (compatible; NEWT ActiveX; Win32)" spam_bot
- SetEnvIfNoCase User-Agent "CherryPicker/1.0" spam_bot
- SetEnvIfNoCase User-Agent "CherryPickerSE/1.0" spam_bot
- SetEnvIfNoCase User-Agent "CherryPickerElite/1.0" spam_bot
- SetEnvIfNoCase User-Agent "NICErsPRO" spam_bot
- SetEnvIfNoCase User-Agent "WebBandit/2.1" spam_bot
- SetEnvIfNoCase User-Agent "WebBandit/3.50" spam_bot
- SetEnvIfNoCase User-Agent "webbandit/4.00.0" spam_bot
- SetEnvIfNoCase User-Agent "WebEMailExtractor/1.0B" spam_bot
- SetEnvIfNoCase User-Agent "autoemailspider" spam_bot
- <Limit GET POST HEAD>
- Order Allow,Deny
- Allow from all
- Deny from env=bad_bot
- Deny from env=spam_bot
- </Limit>
If anybody knows of anymore site archivers or email collectors to block please post it here.
Ozzu Hosting - Want your website on a fast server like Ozzu?
- Anonymous
- Bot


- Joined: 25 Feb 2008
- Posts: ?
- Loc: Ozzuland
- Status: Online
November 9th, 2003, 5:36 pm
- joebert
- Sledgehammer


- Joined: Feb 10, 2004
- Posts: 13455
- Loc: Florida
- Status: Offline
Might be interesting to see if any of these bots are still around, & what new ones have surfaced in the last few years. 
Here's a list of additions I'd come across the other day.
Here's a list of additions I'd come across the other day.
Code: [ Select ]
SetEnvIfNoCase user-agent "^BlackWidow" bad_bot
SetEnvIfNoCase user-agent "^Bot\ mailto:craftbot@yahoo.com" bad_bot
SetEnvIfNoCase user-agent "^ChinaClaw" bad_bot
SetEnvIfNoCase user-agent "^Custo" bad_bot
SetEnvIfNoCase user-agent "^DISCo" bad_bot
SetEnvIfNoCase user-agent "^Download\ Demon" bad_bot
SetEnvIfNoCase user-agent "^eCatch" bad_bot
SetEnvIfNoCase user-agent "^EirGrabber" bad_bot
SetEnvIfNoCase user-agent "^EmailSiphon" bad_bot
SetEnvIfNoCase user-agent "^EmailWolf" bad_bot
SetEnvIfNoCase user-agent "^Express\ WebPictures" bad_bot
SetEnvIfNoCase user-agent "^ExtractorPro" bad_bot
SetEnvIfNoCase user-agent "^EyeNetIE" bad_bot
SetEnvIfNoCase user-agent "^FlashGet" bad_bot
SetEnvIfNoCase user-agent "^GetRight" bad_bot
SetEnvIfNoCase user-agent "^GetWeb!" bad_bot
SetEnvIfNoCase user-agent "^Go!Zilla" bad_bot
SetEnvIfNoCase user-agent "^Go-Ahead-Got-It" bad_bot
SetEnvIfNoCase user-agent "^GrabNet" bad_bot
SetEnvIfNoCase user-agent "^Grafula" bad_bot
SetEnvIfNoCase user-agent "^HMView" bad_bot
SetEnvIfNoCase user-agent “HTTrack” bad_bot
SetEnvIfNoCase user-agent "^Image\ Stripper" bad_bot
SetEnvIfNoCase user-agent "^Image\ Sucker" bad_bot
SetEnvIfNoCase user-agent "Indy\ Library" [NC,OR]
SetEnvIfNoCase user-agent "^InterGET" bad_bot
SetEnvIfNoCase user-agent "^Internet\ Ninja" bad_bot
SetEnvIfNoCase user-agent "^JetCar" bad_bot
SetEnvIfNoCase user-agent "^JOC\ Web\ Spider" bad_bot
SetEnvIfNoCase user-agent "^larbin" bad_bot
SetEnvIfNoCase user-agent "^LeechFTP" bad_bot
SetEnvIfNoCase user-agent "^Mass\ Downloader" bad_bot
SetEnvIfNoCase user-agent "^MIDown\ tool" bad_bot
SetEnvIfNoCase user-agent "^Mister\ PiX" bad_bot
SetEnvIfNoCase user-agent "^Navroad" bad_bot
SetEnvIfNoCase user-agent "^NearSite" bad_bot
SetEnvIfNoCase user-agent "^NetAnts" bad_bot
SetEnvIfNoCase user-agent "^NetSpider" bad_bot
SetEnvIfNoCase user-agent "^Net\ Vampire" bad_bot
SetEnvIfNoCase user-agent "^NetZIP" bad_bot
SetEnvIfNoCase user-agent "^Octopus" bad_bot
SetEnvIfNoCase user-agent "^Offline\ Explorer" bad_bot
SetEnvIfNoCase user-agent "^Offline\ Navigator" bad_bot
SetEnvIfNoCase user-agent "^PageGrabber" bad_bot
SetEnvIfNoCase user-agent "^Papa\ Foto" bad_bot
SetEnvIfNoCase user-agent "^pavuk" bad_bot
SetEnvIfNoCase user-agent "^pcBrowser" bad_bot
SetEnvIfNoCase user-agent "^RealDownload" bad_bot
SetEnvIfNoCase user-agent "^ReGet" bad_bot
SetEnvIfNoCase user-agent "^SiteSnagger" bad_bot
SetEnvIfNoCase user-agent "^SmartDownload" bad_bot
SetEnvIfNoCase user-agent "^SuperBot" bad_bot
SetEnvIfNoCase user-agent "^SuperHTTP" bad_bot
SetEnvIfNoCase user-agent "^Surfbot" bad_bot
SetEnvIfNoCase user-agent "^tAkeOut" bad_bot
SetEnvIfNoCase user-agent "^Teleport\ Pro" bad_bot
SetEnvIfNoCase user-agent "^VoidEYE" bad_bot
SetEnvIfNoCase user-agent "^Web\ Image\ Collector" bad_bot
SetEnvIfNoCase user-agent "^Web\ Sucker" bad_bot
SetEnvIfNoCase user-agent "^WebAuto" bad_bot
SetEnvIfNoCase user-agent "^WebCopier" bad_bot
SetEnvIfNoCase user-agent "^WebFetch" bad_bot
SetEnvIfNoCase user-agent "^WebGo\ IS" bad_bot
SetEnvIfNoCase user-agent "^WebLeacher" bad_bot
SetEnvIfNoCase user-agent "^WebReaper" bad_bot
SetEnvIfNoCase user-agent "^WebSauger" bad_bot
SetEnvIfNoCase user-agent "^Website\ eXtractor" bad_bot
SetEnvIfNoCase user-agent "^Website\ Quester" bad_bot
SetEnvIfNoCase user-agent "^WebStripper" bad_bot
SetEnvIfNoCase user-agent "^WebWhacker" bad_bot
SetEnvIfNoCase user-agent "^WebZIP" bad_bot
SetEnvIfNoCase user-agent "^Widow" bad_bot
SetEnvIfNoCase user-agent "^WWWOFFLE" bad_bot
SetEnvIfNoCase user-agent "^Xaldon\ WebSpider" bad_bot
SetEnvIfNoCase user-agent "^Zeus" bad_bot
SetEnvIfNoCase user-agent "^Bot\ mailto:craftbot@yahoo.com" bad_bot
SetEnvIfNoCase user-agent "^ChinaClaw" bad_bot
SetEnvIfNoCase user-agent "^Custo" bad_bot
SetEnvIfNoCase user-agent "^DISCo" bad_bot
SetEnvIfNoCase user-agent "^Download\ Demon" bad_bot
SetEnvIfNoCase user-agent "^eCatch" bad_bot
SetEnvIfNoCase user-agent "^EirGrabber" bad_bot
SetEnvIfNoCase user-agent "^EmailSiphon" bad_bot
SetEnvIfNoCase user-agent "^EmailWolf" bad_bot
SetEnvIfNoCase user-agent "^Express\ WebPictures" bad_bot
SetEnvIfNoCase user-agent "^ExtractorPro" bad_bot
SetEnvIfNoCase user-agent "^EyeNetIE" bad_bot
SetEnvIfNoCase user-agent "^FlashGet" bad_bot
SetEnvIfNoCase user-agent "^GetRight" bad_bot
SetEnvIfNoCase user-agent "^GetWeb!" bad_bot
SetEnvIfNoCase user-agent "^Go!Zilla" bad_bot
SetEnvIfNoCase user-agent "^Go-Ahead-Got-It" bad_bot
SetEnvIfNoCase user-agent "^GrabNet" bad_bot
SetEnvIfNoCase user-agent "^Grafula" bad_bot
SetEnvIfNoCase user-agent "^HMView" bad_bot
SetEnvIfNoCase user-agent “HTTrack” bad_bot
SetEnvIfNoCase user-agent "^Image\ Stripper" bad_bot
SetEnvIfNoCase user-agent "^Image\ Sucker" bad_bot
SetEnvIfNoCase user-agent "Indy\ Library" [NC,OR]
SetEnvIfNoCase user-agent "^InterGET" bad_bot
SetEnvIfNoCase user-agent "^Internet\ Ninja" bad_bot
SetEnvIfNoCase user-agent "^JetCar" bad_bot
SetEnvIfNoCase user-agent "^JOC\ Web\ Spider" bad_bot
SetEnvIfNoCase user-agent "^larbin" bad_bot
SetEnvIfNoCase user-agent "^LeechFTP" bad_bot
SetEnvIfNoCase user-agent "^Mass\ Downloader" bad_bot
SetEnvIfNoCase user-agent "^MIDown\ tool" bad_bot
SetEnvIfNoCase user-agent "^Mister\ PiX" bad_bot
SetEnvIfNoCase user-agent "^Navroad" bad_bot
SetEnvIfNoCase user-agent "^NearSite" bad_bot
SetEnvIfNoCase user-agent "^NetAnts" bad_bot
SetEnvIfNoCase user-agent "^NetSpider" bad_bot
SetEnvIfNoCase user-agent "^Net\ Vampire" bad_bot
SetEnvIfNoCase user-agent "^NetZIP" bad_bot
SetEnvIfNoCase user-agent "^Octopus" bad_bot
SetEnvIfNoCase user-agent "^Offline\ Explorer" bad_bot
SetEnvIfNoCase user-agent "^Offline\ Navigator" bad_bot
SetEnvIfNoCase user-agent "^PageGrabber" bad_bot
SetEnvIfNoCase user-agent "^Papa\ Foto" bad_bot
SetEnvIfNoCase user-agent "^pavuk" bad_bot
SetEnvIfNoCase user-agent "^pcBrowser" bad_bot
SetEnvIfNoCase user-agent "^RealDownload" bad_bot
SetEnvIfNoCase user-agent "^ReGet" bad_bot
SetEnvIfNoCase user-agent "^SiteSnagger" bad_bot
SetEnvIfNoCase user-agent "^SmartDownload" bad_bot
SetEnvIfNoCase user-agent "^SuperBot" bad_bot
SetEnvIfNoCase user-agent "^SuperHTTP" bad_bot
SetEnvIfNoCase user-agent "^Surfbot" bad_bot
SetEnvIfNoCase user-agent "^tAkeOut" bad_bot
SetEnvIfNoCase user-agent "^Teleport\ Pro" bad_bot
SetEnvIfNoCase user-agent "^VoidEYE" bad_bot
SetEnvIfNoCase user-agent "^Web\ Image\ Collector" bad_bot
SetEnvIfNoCase user-agent "^Web\ Sucker" bad_bot
SetEnvIfNoCase user-agent "^WebAuto" bad_bot
SetEnvIfNoCase user-agent "^WebCopier" bad_bot
SetEnvIfNoCase user-agent "^WebFetch" bad_bot
SetEnvIfNoCase user-agent "^WebGo\ IS" bad_bot
SetEnvIfNoCase user-agent "^WebLeacher" bad_bot
SetEnvIfNoCase user-agent "^WebReaper" bad_bot
SetEnvIfNoCase user-agent "^WebSauger" bad_bot
SetEnvIfNoCase user-agent "^Website\ eXtractor" bad_bot
SetEnvIfNoCase user-agent "^Website\ Quester" bad_bot
SetEnvIfNoCase user-agent "^WebStripper" bad_bot
SetEnvIfNoCase user-agent "^WebWhacker" bad_bot
SetEnvIfNoCase user-agent "^WebZIP" bad_bot
SetEnvIfNoCase user-agent "^Widow" bad_bot
SetEnvIfNoCase user-agent "^WWWOFFLE" bad_bot
SetEnvIfNoCase user-agent "^Xaldon\ WebSpider" bad_bot
SetEnvIfNoCase user-agent "^Zeus" bad_bot
- SetEnvIfNoCase user-agent "^BlackWidow" bad_bot
- SetEnvIfNoCase user-agent "^Bot\ mailto:craftbot@yahoo.com" bad_bot
- SetEnvIfNoCase user-agent "^ChinaClaw" bad_bot
- SetEnvIfNoCase user-agent "^Custo" bad_bot
- SetEnvIfNoCase user-agent "^DISCo" bad_bot
- SetEnvIfNoCase user-agent "^Download\ Demon" bad_bot
- SetEnvIfNoCase user-agent "^eCatch" bad_bot
- SetEnvIfNoCase user-agent "^EirGrabber" bad_bot
- SetEnvIfNoCase user-agent "^EmailSiphon" bad_bot
- SetEnvIfNoCase user-agent "^EmailWolf" bad_bot
- SetEnvIfNoCase user-agent "^Express\ WebPictures" bad_bot
- SetEnvIfNoCase user-agent "^ExtractorPro" bad_bot
- SetEnvIfNoCase user-agent "^EyeNetIE" bad_bot
- SetEnvIfNoCase user-agent "^FlashGet" bad_bot
- SetEnvIfNoCase user-agent "^GetRight" bad_bot
- SetEnvIfNoCase user-agent "^GetWeb!" bad_bot
- SetEnvIfNoCase user-agent "^Go!Zilla" bad_bot
- SetEnvIfNoCase user-agent "^Go-Ahead-Got-It" bad_bot
- SetEnvIfNoCase user-agent "^GrabNet" bad_bot
- SetEnvIfNoCase user-agent "^Grafula" bad_bot
- SetEnvIfNoCase user-agent "^HMView" bad_bot
- SetEnvIfNoCase user-agent “HTTrack” bad_bot
- SetEnvIfNoCase user-agent "^Image\ Stripper" bad_bot
- SetEnvIfNoCase user-agent "^Image\ Sucker" bad_bot
- SetEnvIfNoCase user-agent "Indy\ Library" [NC,OR]
- SetEnvIfNoCase user-agent "^InterGET" bad_bot
- SetEnvIfNoCase user-agent "^Internet\ Ninja" bad_bot
- SetEnvIfNoCase user-agent "^JetCar" bad_bot
- SetEnvIfNoCase user-agent "^JOC\ Web\ Spider" bad_bot
- SetEnvIfNoCase user-agent "^larbin" bad_bot
- SetEnvIfNoCase user-agent "^LeechFTP" bad_bot
- SetEnvIfNoCase user-agent "^Mass\ Downloader" bad_bot
- SetEnvIfNoCase user-agent "^MIDown\ tool" bad_bot
- SetEnvIfNoCase user-agent "^Mister\ PiX" bad_bot
- SetEnvIfNoCase user-agent "^Navroad" bad_bot
- SetEnvIfNoCase user-agent "^NearSite" bad_bot
- SetEnvIfNoCase user-agent "^NetAnts" bad_bot
- SetEnvIfNoCase user-agent "^NetSpider" bad_bot
- SetEnvIfNoCase user-agent "^Net\ Vampire" bad_bot
- SetEnvIfNoCase user-agent "^NetZIP" bad_bot
- SetEnvIfNoCase user-agent "^Octopus" bad_bot
- SetEnvIfNoCase user-agent "^Offline\ Explorer" bad_bot
- SetEnvIfNoCase user-agent "^Offline\ Navigator" bad_bot
- SetEnvIfNoCase user-agent "^PageGrabber" bad_bot
- SetEnvIfNoCase user-agent "^Papa\ Foto" bad_bot
- SetEnvIfNoCase user-agent "^pavuk" bad_bot
- SetEnvIfNoCase user-agent "^pcBrowser" bad_bot
- SetEnvIfNoCase user-agent "^RealDownload" bad_bot
- SetEnvIfNoCase user-agent "^ReGet" bad_bot
- SetEnvIfNoCase user-agent "^SiteSnagger" bad_bot
- SetEnvIfNoCase user-agent "^SmartDownload" bad_bot
- SetEnvIfNoCase user-agent "^SuperBot" bad_bot
- SetEnvIfNoCase user-agent "^SuperHTTP" bad_bot
- SetEnvIfNoCase user-agent "^Surfbot" bad_bot
- SetEnvIfNoCase user-agent "^tAkeOut" bad_bot
- SetEnvIfNoCase user-agent "^Teleport\ Pro" bad_bot
- SetEnvIfNoCase user-agent "^VoidEYE" bad_bot
- SetEnvIfNoCase user-agent "^Web\ Image\ Collector" bad_bot
- SetEnvIfNoCase user-agent "^Web\ Sucker" bad_bot
- SetEnvIfNoCase user-agent "^WebAuto" bad_bot
- SetEnvIfNoCase user-agent "^WebCopier" bad_bot
- SetEnvIfNoCase user-agent "^WebFetch" bad_bot
- SetEnvIfNoCase user-agent "^WebGo\ IS" bad_bot
- SetEnvIfNoCase user-agent "^WebLeacher" bad_bot
- SetEnvIfNoCase user-agent "^WebReaper" bad_bot
- SetEnvIfNoCase user-agent "^WebSauger" bad_bot
- SetEnvIfNoCase user-agent "^Website\ eXtractor" bad_bot
- SetEnvIfNoCase user-agent "^Website\ Quester" bad_bot
- SetEnvIfNoCase user-agent "^WebStripper" bad_bot
- SetEnvIfNoCase user-agent "^WebWhacker" bad_bot
- SetEnvIfNoCase user-agent "^WebZIP" bad_bot
- SetEnvIfNoCase user-agent "^Widow" bad_bot
- SetEnvIfNoCase user-agent "^WWWOFFLE" bad_bot
- SetEnvIfNoCase user-agent "^Xaldon\ WebSpider" bad_bot
- SetEnvIfNoCase user-agent "^Zeus" bad_bot
Strong with this one, the sudo is.
Page 1 of 1
To Reply to this topic you need to LOGIN or REGISTER. It is free.
Post Information
- Total Posts in this topic: 3 posts
- Users browsing this forum: No registered users and 70 guests
- You cannot post new topics in this forum
- You cannot reply to topics in this forum
- You cannot edit your posts in this forum
- You cannot delete your posts in this forum
- You cannot post attachments in this forum

