PDFResurrect is a tool aimed at analyzing PDF documents. The PDF format allows for previous document changes to be retained in a more recent version of the document, thereby creating a running history of changes for the document. This tool attempts to extract all previous versions while also producing a summary of changes between versions.
PDFResurrect can also "scrub" or write data over the original instances of PDF objects that have been modified or deleted, in an effort to disguise information from previous versions that might not be intended for anyone else to read.
This project is released under the GNU GPLv3 license. So have at it!
Many individuals were consulted on this and provided suggestions, including Tele, Remad, Derez, Count, and Sunpuke. Special thanks to Brent, not really part of the 757 crew, but aided in proofreading the paper. Thanks guys!
What's New in This Release: [ read full changelog ]
· The main fix regards how the tool was locating the EOF token which the PDF writers place at the end of different versions of PDFs.
· Previously, if an EOF token was split across a 256-byte boundary, then Mr. Resurrect would not have found it.
· The new algorithm might be a tad slower, but is more precise.