Linux SoftwareInternetHTTP (WWW)bot-trap 0.92

bot-trap 0.92


bot-trap allows your Web site to automatically ban bad Web robots (a.k.a
Developer:   Daniel M. Webb
      more software by author →
Price:  0.00
License:   Public Domain
File size:   6K
Language:   
OS:   
Rating:   0 /5 (0 votes)
Your vote:  
enlarge screenshot


bot-trap allows your Web site to automatically ban bad Web robots (a.k.a. Web spiders) that ignore the robots.txt file.

This does not include Googlebot and other well-behaved robots.

The main advantage over other implementations of this concept is that bot-trap has a manual "unban" feature so that humans can unban, but robots can't.

How It Works:

- You place a small "web-bug" strategically in your web pages. This bug is just a tiny image link that says to go to /bot-trap/index.php. Normal people don't see this link, but web bots do.

- You create a /robots.txt file that tells web bots not to go to the /bot-trap directory.
When the bad robot visits /bot-trap/index.php anyway, /bot-trap/index.php adds the IP address of the bad bot to a block list in /.htaccess. They are blocked from access to the site from then on. You can also be emailed when this happens.

Safeguards

It is possible that someone is banned who shouldn't be. Perhaps a previous user of an IP address in a DHCP pool was a naughty user and ran a bad bot, but now the new user is banned. Not to worry, the custom "403 Forbidden" page allows any user to unban themselves by typing a requested word into a form box. Real people can easily do this, but bots can't!

Installation:

1. Unpack the tarball in your web page root directory:
# tar -xzf bot-trap-x.x.tar.gz
2. Either add a line to your root .htaccess file like:
ErrorDocument 403 /bot-trap/forbid.php
or copy the premade one (bot-trap/htaccess-root-example). Notice that since once an IP is banned, it can't access anything in /, so the 403 page should be in /bot-trap, and /bot-trap/.htaccess should only say "Allow from all". Look at the forbid.php file in the distribution to see how to do this, or just use it as-is.
3. Make sure .htaccess controls are allowed in your Apache configuration (especially the "AllowOverride" directive). This allows bot-trap to ban IP addresses using the htaccess mechanism.
4. Create the empty file blacklist.dat in your web root directory, and make blacklist.dat, .htaccess, and the bot-trap directory in your web root directory owned by the www user with write permission. If web server uses a group (like the group "www-data" on Debian GNU/Linux), set these files and directories group-writable.
5. Edit bot-trap/settings.php to hold the correct email addresses to send alerts to.
6. Add "web-bugs" to your main web page to catch the bad bots. This is the XHTML code:
< !-- Bad robot trap: Don't go here or your IP will be banned! -->
< a href="/bot-trap/">< img src="bot-trap/pixel.gif" border="0"
alt=" " width="1" height="1"/>< /a>

7. Add the bot-trap directory to your robots.txt file, or copy the example robots.txt file (bot-trap/robots.txt.example) to the root directory.
8. Make sure /.htaccess and all other files have the correct permissions and ownership for your site.
tags bot trap  robots txt  your web  root directory  txt file  the bad  trap directory  the bot  trap index  index php  blacklist dat  sure htaccess  the correct  

Download bot-trap 0.92


 http://danielwebb.us/software/bot-trap/bot-trap_0.9.2.tar.gz


Authors software

CPAR 0.9.1 (by Daniel M. Webb)
CPAR is a C library that allows complex parameter passing such as found in interpreted languages.

This allows passing named parame

bot-trap 0.92 (by Daniel M. Webb)
bot-trap allows your Web site to automatically ban bad Web robots (a.k.a


Similar software

bot-trap 0.92 (by Daniel M. Webb)
bot-trap allows your Web site to automatically ban bad Web robots (a.k.a

Template::Tutorial 2.15 (by Andy Wardley)
Template::Tutorial are template toolkit tutorials.

This section includes tutorials on using the Template Toolkit

Jawa Open Eyes 5.1 (by Haris Hasanudin)

MathML::Entities 0.13 (by Jacques Distler)
MathML::Entities is a Perl module that can convert XHTML+MathML Named Entities to Numeric Character References.

SYNOPSIS

use M

LooperNG 1.3 (by Mohit Muthanna)
LooperNG is an intelligent event routing daemon

SNMP Trap Translator 1.1 (by Alex Burger)
SNMP Trap Translator is an SNMP trap handler written in Perl for use with the NET-SNMP and UCD-SNMP snmptrapd program

robots.txt 2.0 (by Andrew Smith)
robots.txt is a script in PHP that acts like a normal robots.txt file, but with a few differences.

When a Spider attempts to access

Clement 2.1-202 (by Jean-Marc Pigeon)
Clement is an application working as an E-mail server firewall, its only purpose is to trap most of the unwanted mail (SPAM) at the e

Honeytrap 0.6.3.1 (by Tillmann Werner)
Honeytrap project trap attacks against tcp services

Robocode Robots 1.0 (by Marco Bakera)
Robocode Robots project is a collection of robocode robots.

This project is a collection of robots for the robocode framework


Other software in this category

SquirrelMail 1.5.1 (by The SquirrelMail Project Team)
SquirrelMail is a standards-based Webmail package written in PHP4

Tiki CMS/Groupware 1.9.7 (by Luis Argerich)

Downloader for X 2.5.7 (by Chuchelo)
Downloader for X is a tool for downloading files from the Internet via both HTT

Links 2.1pre26 (by Martin Pergel)
Links is graphics and text mode WWW browser, similar to Lynx

Mozilla Firefox 1.5.0.8 (by Mozilla Project)

    search


Featured Software

jEdit 4.3 pre8
jEdit is an Open Source text editor written in Java

Opera 9.02
Surf the Internet in a safer, faster, and easier way with Opera browser

GNU Aspell 0.60.4
GNU Aspell is a Free and Open Source spell checker designed to eventually replace Ispell


Subscribe in Rojo
Google Reader
Add to My Yahoo!

Add to My AOL
Subscribe with Bloglines
Subscribe in NewsGator Online
Add 'nixbit linux software' to Newsburst from CNET News.com
del.icio.us nixbit linux software


Top tags