PDFTextStream

  2,382 downloads
2.6.0 Other/Proprietary License with Free Tr...
2.5/5 2
A PDF text and metadata extraction library available for Java, Python, and .NET.

description

download

specifications

changelog

buy now $1900.00  

PDFTextStream project is a PDF text and metadata extraction library available for Java, Python, and .NET.

It supports all versions of the PDF document specification, (including v1.6, used by Acrobat 7), extraction of text encoded using double-byte character sets (including Chinese, Japanese, and Korean), decryption of 40-bit and 128-bit encrypted documents, and extraction of all document metadata provided by PDF documents (including form data, bookmarks, and annotations).

Easy integration with Jakarta Lucene is included.
read more   
Last updated on August 10th, 2012

0 User reviews so far.

SUBMIT