PDFTextStream 2.6.0

A PDF text and metadata extraction library available for Java, Python, and .NET.
PDFTextStream project is a PDF text and metadata extraction library available for Java, Python, and .NET.

It supports all versions of the PDF document specification, (including v1.6, used by Acrobat 7), extraction of text encoded using double-byte character sets (including Chinese, Japanese, and Korean), decryption of 40-bit and 128-bit encrypted documents, and extraction of all document metadata provided by PDF documents (including form data, bookmarks, and annotations).

Easy integration with Jakarta Lucene is included.

last updated on:
August 10th, 2012, 10:14 GMT
price:
$1900.00
 
developed by:
Snowtide Informatics Systems, Inc.
homepage:
snowtide.com
license type:
Other/Proprietary License with Free Tr...
category:
ROOT \ Information Management

In a hurry? Add it to your Download Basket!

user rating 2

2.5/5
 

0/5

What's New in version 2.3.2
  • This version includes a variety of fixes made to ensure PDFTextStream is capable of extracting text from PDF documents that are nonconforming to the PDF specification.
  • It also includes a variety of performance enhancements.
read full changelog

Add your review!

SUBMIT