web-archive-creator 0.04

web-archive-creator can be used to join the power of wget and the usability of the .war format, which Konqueror is able to brows

  Add it to your Download Basket!

 Add it to your Watch List!

0/5

Rate it!
send us
an update
LICENSE TYPE:
GPL v3 
USER RATING:
3.3/5 19
DEVELOPED BY:
Ralf Beckesch
HOMEPAGE:
wrb.beckesch-it.de
CATEGORY:
ROOT \ System \ Archiving
1 web-archive-creator Screenshot:
web-archive-creator
web-archive-creator can be used to join the power of wget and the usability of the .war format, which Konqueror is able to browse. The Web ARrchive format is simply a tarred and gzipped file containing the Web files. The web-archive-creator script performs the download with wget and writes an index.html linking to the starting pages/URLs wget fetched.

Usage:

$ web-archive-creator -h

web-archive-creator

A small script to create a _Web _ARchive file using wget to download the site

web-archive-creator [-h|--help]
# surprise: print this help
[-d|--debug]
# enable debugging output
[-v|--verbose]
# be verbose to the screen
[-o|--outputfile] FILENAME.war
# required!
[ [-W|--wget-option] -some -wget -options .. -- ]
# important trailing '--' !
[-l|--logfile] LOGFILE
# write verbose output to LOGFILE!
[-V|--version]
# print version and exit
http://www.url1.org ... http://finaly-one-more-url.net
# one url at least.

Examples:

Download only the given URLs

web-archive-creator -o WAR_FILE.war http://ulr1.org/hello.html

http://url2.net/some/dir/file.php http://ulr3.tv/file.html

Download only the given URLs, but be verbose

web-archive-creator -v -o WAR_FILE.war http://ulr1.org/hello.html

http://url2.net/some/dir/file.php http://ulr3.tv/file.html

Download only the given URLs and all refernced pages, so dig one link in depth

web-archive-creator -v -o WAR_FILE.war -W -r -l 1 -- http://ulr1.org/hello.html

http://url2.net/some/dir/file.php http://ulr3.tv/file.html

Download only the given URLs and all refernced pages, so dig one link in depth. Span to links != originating host!

web-archive-creator -v -o WAR_FILE.war -W -H -r -l 1 --

http://ulr1.org/hello.html http://url2.net/some/dir/file.php

You see, you can give parameters to wget directly, using -W parm1 parm2 ..parmN –

What's New in This Release:

· tidied up code

rewritten and moved to a shell function
· local file recognition
· writing index.html
· wget call

options
· -t|--tempdir: choose other temporary directory than sytsem variable $TMP
· -T|--type: choose archive type to create function not implemented yet!!!
· konqueror|kon → Konqueror
· mozilla|moz → Mozilla
· -o|–outputfile: is no more mandatory. If an explicit filename is missing, an archive will be created in $TMP named $(basename $0)-$($DATE +%Y%m%d)-$($DATE +%H%M%S)

Last updated on January 5th, 2008

#web archiver #web backup #website archiver #Konqueror #wget #web #archiver

Add your review!

SUBMIT