Softpedia
 


LINUX CATEGORIES:



GLOBAL PAGES >>
NEWS ARCHIVE >>
SOFTPEDIA REVIEWS >>
MEET THE EDITORS >>
WEEK'S BEST
  • Linux Kernel 3.9.3 / 3....
  • LibreOffice 3.6.6 / 4.0.3
  • MPlayer 1.1.1
  • systemd 204
  • Arch Linux 2013.05.01
  • Blender 2.67
  • KDE Software Compilatio...
  • CrunchBang Linux Stable...
  • Elementary OS 0.1 / 0.2...
  • SystemRescueCd 3.6.0
  • Home > Linux > Programming > Libraries

    warc 0.2.1

    Download button

    No screenshots available
    Downloads: 231  Tell us about an update
    User Rating:
    Rated by:
    NOT RATED
    0 user(s)
    Developer:

    License / Price:

    Last Updated:

    Category:
    Anand Chitipothu | More programs
    BSD License / FREE
    May 16th, 2012, 10:53 GMT
    ROOT / Programming / Libraries

     Read user reviews (0)  Refer to a friend  Subscribe

    warc description

    Python library to work with WARC files

    warc (Web ARChive) is a file format for storing web crawls.

    http://www.scribd.com/doc/4303719/WARC-ISO-28500-final-draft-v018-Zentveld-080618

    This warc library makes it very easy to work with WARC files.:

    import warc
    f = warc.open("test.warc")
    for record in f:
     print record['WARC-Target-URI'], record['Content-Length']


    Documentation

    The documentation of the warc library is available at http://readthedocs.org/docs/warc/en/latest/


    Product's homepage

    Requirements:

    · Python

      


    TAGS:

    WARC handler | Python library | web crawler | Python | WARC | handler

    Go to top

    WindowsGamesDriversMacLinuxScriptsMobileHandheldNews

    SUBMIT PROGRAM   |   ADVERTISE   |   GET HELP   |   SEND US FEEDBACK   |   RSS FEEDS   |   UPDATE YOUR SOFTWARE   |   ROMANIAN FORUM