web-archive-creator 0.04

web-archive-creator can be used to join the power of wget and the usability of the .war format, which Konqueror is able to brows
web-archive-creator can be used to join the power of wget and the usability of the .war format, which Konqueror is able to browse. The Web ARrchive format is simply a tarred and gzipped file containing the Web files. The web-archive-creator script performs the download with wget and writes an index.html linking to the starting pages/URLs wget fetched.

Usage:

$ web-archive-creator -h

web-archive-creator

A small script to create a _Web _ARchive file using wget to download the site

web-archive-creator [-h|--help]
# surprise: print this help
[-d|--debug]
# enable debugging output
[-v|--verbose]
# be verbose to the screen
[-o|--outputfile] FILENAME.war
# required!
[ [-W|--wget-option] -some -wget -options .. -- ]
# important trailing '--' !
[-l|--logfile] LOGFILE
# write verbose output to LOGFILE!
[-V|--version]
# print version and exit
http://www.url1.org ... http://finaly-one-more-url.net
# one url at least.

Examples:

Download only the given URLs

web-archive-creator -o WAR_FILE.war http://ulr1.org/hello.html

http://url2.net/some/dir/file.php http://ulr3.tv/file.html

Download only the given URLs, but be verbose

web-archive-creator -v -o WAR_FILE.war http://ulr1.org/hello.html

http://url2.net/some/dir/file.php http://ulr3.tv/file.html

Download only the given URLs and all refernced pages, so dig one link in depth

web-archive-creator -v -o WAR_FILE.war -W -r -l 1 -- http://ulr1.org/hello.html

http://url2.net/some/dir/file.php http://ulr3.tv/file.html

Download only the given URLs and all refernced pages, so dig one link in depth. Span to links != originating host!

web-archive-creator -v -o WAR_FILE.war -W -H -r -l 1 --

http://ulr1.org/hello.html http://url2.net/some/dir/file.php

You see, you can give parameters to wget directly, using -W parm1 parm2 ..parmN –

What's New in This Release:

· tidied up code

rewritten and moved to a shell function
· local file recognition
· writing index.html
· wget call

options
· -t|--tempdir: choose other temporary directory than sytsem variable $TMP
· -T|--type: choose archive type to create function not implemented yet!!!
· konqueror|kon → Konqueror
· mozilla|moz → Mozilla
· -o|–outputfile: is no more mandatory. If an explicit filename is missing, an archive will be created in $TMP named $(basename $0)-$($DATE +%Y%m%d)-$($DATE +%H%M%S)

last updated on:
January 5th, 2008, 18:36 GMT
price:
FREE!
developed by:
Ralf Beckesch
homepage:
wrb.beckesch-it.de
license type:
GPL v3 
category:
ROOT \ System \ Archiving

FREE!

In a hurry? Add it to your Download Basket!

user rating 19

3.3/5
 

0/5

1 Screenshot
web-archive-creator

Add your review!

SUBMIT