web-archive-creator

  895 downloads
0.04 GPL v3    
3.3/5 19
web-archive-creator can be used to join the power of wget and the usability of the .war format, which Konqueror is able to brows

description

download

specifications

web-archive-creator can be used to join the power of wget and the usability of the .war format, which Konqueror is able to browse. The Web ARrchive format is simply a tarred and gzipped file containing the Web files. The web-archive-creator script performs the download with wget and writes an index.html linking to the starting pages/URLs wget fetched.

Usage:

$ web-archive-creator -h

web-archive-creator

A small script to create a _Web _ARchive file using wget to download the site

web-archive-creator [-h|--help]
# surprise: print this help
[-d|--debug]
# enable debugging output
[-v|--verbose]
# be verbose to the screen
[-o|--outputfile] FILENAME.war
# required!
[ [-W|--wget-option] -some -wget -options .. -- ]
# important trailing '--' !
[-l|--logfile] LOGFILE
# write verbose output to LOGFILE!
[-V|--version]
# print version and exit
http://www.url1.org ... http://finaly-one-more-url.net
# one url at least.

Examples:

Download only the given URLs

web-archive-creator -o WAR_FILE.war http://ulr1.org/hello.html

http://url2.net/some/dir/file.php http://ulr3.tv/file.html

Download only the given URLs, but be verbose

web-archive-creator -v -o WAR_FILE.war http://ulr1.org/hello.html

http://url2.net/some/dir/file.php http://ulr3.tv/file.html

Download only the given URLs and all refernced pages, so dig one link in depth

web-archive-creator -v -o WAR_FILE.war -W -r -l 1 -- http://ulr1.org/hello.html

http://url2.net/some/dir/file.php http://ulr3.tv/file.html

Download only the given URLs and all refernced pages, so dig one link in depth. Span to links != originating host!

web-archive-creator -v -o WAR_FILE.war -W -H -r -l 1 --

http://ulr1.org/hello.html http://url2.net/some/dir/file.php

You see, you can give parameters to wget directly, using -W parm1 parm2 ..parmN –

What's New in This Release:

· tidied up code

rewritten and moved to a shell function
· local file recognition
· writing index.html
· wget call

options
· -t|--tempdir: choose other temporary directory than sytsem variable $TMP
· -T|--type: choose archive type to create function not implemented yet!!!
· konqueror|kon → Konqueror
· mozilla|moz → Mozilla
· -o|–outputfile: is no more mandatory. If an explicit filename is missing, an archive will be created in $TMP named $(basename $0)-$($DATE +%Y%m%d)-$($DATE +%H%M%S)
READ MORE   
Last updated on January 5th, 2008
1  
web-archive-creator

0 User reviews so far.

SUBMIT