Softpedia
 


LINUX CATEGORIES:



GLOBAL PAGES >>
NEWS ARCHIVE >>
SOFTPEDIA REVIEWS >>
MEET THE EDITORS >>
WEEK'S BEST
  • BackTrack 5 R2
  • Wine 1.4 / 1.5.5
  • Mozilla Firefox 12...
  • Ubuntu 11.04
  • Angry Birds 1.1.2.1
  • Ubuntu 10.04.4 LTS
  • Linux Kernel 3.4
  • Ubuntu Manual 10.10
  • Adobe Flash Player...
  • Pidgin 2.10.4
  • Home > Linux > Internet > HTTP (WWW)

    spydey 0.5

    Download button

    No screenshots available
    Downloads: 147  Tell us about an update
    User Rating:
    Rated by:
    NOT RATED
    0 user(s)
    Developer:

    License / Price:

    Last Updated:

    Category:
    Paul M. Winkler | More programs
    MIT/X Consortium Lic... / FREE
    February 15th, 2012, 01:43 GMT [view history]
    ROOT / Internet / HTTP (WWW)

     Read user reviews (0)  Refer to a friend  Subscribe

    spydey description

    A simple web spider with pluggable recursion strategies

    spydey is a simple web spider with several recursion strategies.

    It doesn't do much except follow links and report status. I mostly use it for quick and dirty smoke testing and link checking.

    The only unusual feature is the --traversal=pattern option, which does recursive traversal in an unusual order: It tries to recognize patterns in URLs, and will follow URLs of novel patterns before those with patterns it has seen before. If you use this for smoke-testing a typical modern web app, it will very quickly hit all your views/controllers at least once... usually.

    Also, it's designed so that adding a new recursion strategy is trivial. Spydey was originally written for the purpose of experimenting with different recursive crawling strategies. Read the source.

    Oh, and if you install Fabulous, console output is in color.

    For smoke testing, I typically run it like:

    spydey -r --max-requests=100 --traversal=pattern --profile --log-referrer URL

    There are a number of other command-line options, many stolen from wget. Use --help to see what they are.


    Product's homepage

    Requirements:

    · Python

    What's New in This Release: [ read full changelog ]

    · Remove useless pattern stats unless --stats is given
    · Fix to prevent spanning hosts when following redirects, unless -H is on.

      


    TAGS:

    python library | web spider | recursion strategies | library | python | spider



    HTML code for linking to this page:


    Go to top

    WindowsGamesDriversMacLinuxScriptsMobileHandheldNews

    SUBMIT PROGRAM   |   ADVERTISE   |   GET HELP   |   SEND US FEEDBACK   |   RSS FEEDS   |   UPDATE YOUR SOFTWARE   |   ROMANIAN FORUM