Tesseract OCR

  6,367 downloads
3.0 The Apache License 2.0    
2.6/5 20
Tesseract OCR is a commercial quality OCR engine originally developed at HP between 1985 and 1995.

description

download

specifications

changelog

Tesseract OCR is a commercial quality OCR engine originally developed at HP between 1985 and 1995. In 1995, this engine was among the top 3 evaluated by UNLV. It was open-sourced by HP and UNLV in 2005. The source code will read a binary, grey or color image and output text. A tiff reader is built in that will read uncompressed TIFF images, or libtiff can be added to read compressed images.

Supported Platforms

The developers are regularly testing on the following platforms:

� Ubuntu 6.06 (x86/32, x86/64)
� Ubuntu 6.10 (x86/32, x86/64)
� Windows (x86/32)

Additionally, we believe that the code should be running on these other platforms, but we don't have the resources to test on them regularly:

� recent Linux distributions (x86/32, x86/64)
� Mac OS X (x86, PPC)

If you're interested in supporting in supporting other platforms or languages, please get in touch with Ray Smith.
READ MORE   
Last updated on October 4th, 2010

0 User reviews so far.

SUBMIT