The PICS label grabber application searches the WWW for PICS Labels and submits them to the label bureau of your choice. It can be used to populate a label bureau for testing, or for any other label bureau purpose.
The PICS label grabber allows the user to specify three URLs, and three numbers corresponding to the amount of searching the robot should do starting at each URL. The robot does a breadth-first search from each URL; i.e., all URLs immediately linked to the start URL ("child" URLs) are searched before any "grandchild" URLs. The robot does not visit a page twice. Once the robot visits a page, all PICS labels are extracted from that page and sent to a label bureau via an HTTP PUT.
Sends a standard PICS PUT to any label bureau. | |
Reads robots.txt at each site to avoid network congestion and respect the wishes of site webmasters. | |
100% Java; uses PICS Standard Library. |
where [address] is the hostname of the label bureau, [port] is the port number of the label bureau, and [resource] is the label bureau resource name.
Direct all questions regarding the PICS Robot to Kyle Jamieson jamieson@mit.edu. |