W3C Robot

Webbot - The Libwww Mini Robot

The webbot is an example application that comes with libwww - the W3C Sample Code Library. It is a very fast Web walker with support for regular expressions and SQL logging facilities. It can be used to check links, find bad HTML, map out a web site, download images, etc.


The Basics

Author(s)
Henrik Frystyk Nielsen
Status
An example application for libwww. The robot is the primary tool used for the HTTP performance measurements.
Plans
New releases follow the releases of libwww with no exception.
Platforms
It runs on the same platforms as libwww
Getting the Source Code
The webbot comes as an integral part of the libwww distribution package.
Forums
Discussions on libwww and its example applications take place on <www-lib@w3.org> (archives). See the documentation for how to subscribe.

Features

How to Run It

Be careful - this is a robot and hence can be used to traverse many links - it should not be used for anything but HTTP/1.1 testing. Especially, it doesn't follow directions indicated in robot.txt files and hence should not be used on the Internet at large.

The robot has a large set of command line options that can be used in a large set of different combinations. You can try and see this simple script in order to see an example of how it can be run.


Henrik Frystyk Nielsen,
@(#) $Id: Overview.html,v 1.19 1998/05/04 19:38:19 frystyk Exp $