PDFTextStream

2.6.0 Other/Proprietary License with Free Tr...
2.5/5 2

description

specifications

changelog

BUY $1900.00

PDFTextStream project is a PDF text and metadata extraction library available for Java, Python, and .NET.

It supports all versions of the PDF document specification, (including v1.6, used by Acrobat 7), extraction of text encoded using double-byte character sets (including Chinese, Japanese, and Korean), decryption of 40-bit and 128-bit encrypted documents, and extraction of all document metadata provided by PDF documents (including form data, bookmarks, and annotations).

Easy integration with Jakarta Lucene is included.
read more   
Last updated on August 10th, 2012

0 User reviews so far.

SUBMIT
A PDF text and metadata extraction library available for Java, Python, and .NET.

  2,382 downloads

#metadata extraction #PDF text extraction #PDF library #PDFTextStream #PDF #text #metadata