Softpedia
 


LINUX CATEGORIES:



GLOBAL PAGES >>
NEWS ARCHIVE >>
SOFTPEDIA REVIEWS >>
MEET THE EDITORS >>
WEEK'S BEST
  • Linux Kernel 3.9.2 / 3....
  • LibreOffice 3.6.6 / 4.0.3
  • MPlayer 1.1.1
  • systemd 204
  • Arch Linux 2013.05.01
  • Blender 2.67
  • KDE Software Compilatio...
  • CrunchBang Linux Stable...
  • Elementary OS 0.1 / 0.2...
  • SystemRescueCd 3.6.0
  • Home > Linux > Programming > Perl Modules

    WWW::Robot 0.026

    Download button

    No screenshots available
    Downloads: 276  Tell us about an update
    User Rating:
    Rated by:
    NOT RATED
    0 user(s)
    Developer:

    License / Price:

    Last Updated:

    Category:
    Neil Bowers | More programs
    Perl Artistic License / FREE
    November 4th, 2008, 14:19 GMT
    ROOT / Programming / Perl Modules

     Read user reviews (0)  Refer to a friend  Subscribe

    WWW::Robot description

    A configurable web traversal engine

    WWW::Robot is a configurable web traversal engine (for web robots & agents).

    SYNOPSIS

     use WWW::Robot;
     
     $robot = new WWW::Robot(
     'NAME' => 'MyRobot',
     'VERSION' => '1.000',
     'EMAIL' => 'fred@foobar.com'
     );
     
     # ... configure the robot's operation ...
     
     $robot->run( 'http://www.foobar.com/' );


    This module implements a configurable web traversal engine, for a robot or other web agent. Given an initial web page (URL), the Robot will get the contents of that page, and extract all links on the page, adding them to a list of URLs to visit.

    Features of the Robot module include:

     * Follows the Robot Exclusion Protocol.
     * Supports the META element proposed extensions to the Protocol.
     * Implements many of the Guidelines for Robot Writers.
     * Configurable.
     * Builds on standard Perl 5 modules for WWW, HTTP, HTML, etc.

    A particular application (robot instance) has to configure the engine using hooks, which are perl functions invoked by the Robot engine at specific points in the control loop.

    The robot engine obeys the Robot Exclusion protocol, as well as a proposed addition. See "SEE ALSO" for references to documents describing the Robot Exclusion protocol and web robots.


    Product's homepage

    Requirements:

    · Perl

      


    TAGS:

    web traversal engine | web robots | web agents | Perl | web | robots

    Go to top

    WindowsGamesDriversMacLinuxScriptsMobileHandheldNews

    SUBMIT PROGRAM   |   ADVERTISE   |   GET HELP   |   SEND US FEEDBACK   |   RSS FEEDS   |   UPDATE YOUR SOFTWARE   |   ROMANIAN FORUM