KHTMLScrub is a program that removes unwanted tags and attributes from html files.
This program came about from a simple request at Groklaw about a GUI frontend for html_scrub.
I started coding one up, but had problems making my program behave as I wanted it too. So after a couple of re-writes, I ended up implementing on its own.
Since 0.3 KHTMLScrub is a Kpart application, which means it can be used as a plugin for Quanta.
$ tar khtmlscrub-version.tar.gz
$ cd khtmlscrub-version
# make install
The basic operation of KHTMLSCrub is very simple. You set the options you want. Open the document and hit "Apply options" then KHTMLScrub will process the document for you, and provide a HTML preview and a Document Source preview so you can see exactly what KHTMLScrub has done.
You can play around with the options, and keep hitting apply until your happy with the result. Then simply save the document. You can also save and load the options, as well as set them as defaults.
KHTMLScrub tries as much as possible to save the original formatting of the HTML source. Including the case of tags and attributes, even though it uses case insensitive searches itself.