GNU Wget is a free software package for retrieving files using HTTP, HTTPS and FTP, the two most widely-used Internet protocols.
Wget is a non-interactive commandline tool, so it may easily be called from scripts, cron jobs, terminals without Xsupport, etc.
Here are some key features of "GNU Wget":
· Can resume aborted downloads, using REST and RANGE
· Can use filename wild cards and recursively mirror directories
· NLS-based message files for many different languages
· Optionally converts absolute links in downloaded documents to relative, so that downloaded documents may link to each other locally
· Runs on most UNIX-like operating systems as well as Microsoft Windows
· Supports HTTP and SOCKS proxies
· Supports HTTP cookies
· Supports persistent HTTP connections
· Unattended / background operation
· Uses local file timestamps to determine whether documents need to be re-downloaded when mirroring
· GNU wget is distributed under the GNU General Public License.
What's New in This Release: [ read full changelog ]
· SECURITY FIX: It had been possible to trick Wget into accepting SSL certificates that don't match the host name, through the trick of embedding NUL characters into the certs' common name. Fixed by Joao Ferreira.
Added support for CSS. This includes:
· Parsing links from CSS files, and from CSS content found in HTML style tags and attributes.
· Supporting conversion of links found within CSS content, when --convert-links is specified.
· Ensuring that CSS files end in the ".css" filename extension, when --convert-links is specified.
· CSS support in Wget is thanks to Ted Mielczarek.
· Added support for Internationalized Resource Identifiers (IRIs, RFC 3987). When support is enabled (requires libidn and libiconv), links with non-ASCII bytes are translated from their source encoding to UTF-8 before percent-encoding. IRI support was added by Saint Xavier, as his project for the Google Summer of Code.
· Wget now provides more sensible exit status codes when downloads don't proceed as expected (see the manual).
· --default-page option (and associated wgetrc command) added to support alternative default names for index.html.
· --ask-password option (and associated wgetrc command) added to support password prompts at the console.
· The --input-file option now also handles retrieving links from an external file.
· The output generated by the --version option now includes information on how it was built, and the set of configure-time options that were selected.
· --html-extension has been renamed to --adjust-extension, to reflect the fact that it now also applies to CSS content. --html-extension is still acceptable, but is now deprecated.
· An "ascii" specifier is now accepted by --restrict-file-names, which forces the percent-encoding of all non-ASCII bytes
· Several previously existing, but undocumented .wgetrc options are now documented: save_headers, spider, and user_agent, auth_no_challenge, and keep_session_cookies. Also added documentation for the "lowercase" and "uppercase" values for --restrict-file-names, which had been present since Wget 1.11.