Tesseract OCR 3.0

Tesseract OCR is a commercial quality OCR engine originally developed at HP between 1985 and 1995.

  Add it to your Download Basket!

 Add it to your Watch List!

0/5

Rate it!

What's new in Tesseract OCR 3.0:

  • Preparations for thread safety:
  • Changed TessBaseAPI methods to be non-static
  • Created a class hierarchy for the directories to hold instance data, and began moving code into the classes.
  • Moved thresholding code to a separate class.
Read full changelog
send us
an update
LICENSE TYPE:
The Apache License 2.0 
USER RATING:
2.6/5 20
DEVELOPED BY:
Ray Smith and Tom
HOMEPAGE:
code.google.com
CATEGORY:
ROOT \ Multimedia \ Graphics
Tesseract OCR is a commercial quality OCR engine originally developed at HP between 1985 and 1995. In 1995, this engine was among the top 3 evaluated by UNLV. It was open-sourced by HP and UNLV in 2005. The source code will read a binary, grey or color image and output text. A tiff reader is built in that will read uncompressed TIFF images, or libtiff can be added to read compressed images.

Supported Platforms

The developers are regularly testing on the following platforms:

� Ubuntu 6.06 (x86/32, x86/64)
� Ubuntu 6.10 (x86/32, x86/64)
� Windows (x86/32)

Additionally, we believe that the code should be running on these other platforms, but we don't have the resources to test on them regularly:

� recent Linux distributions (x86/32, x86/64)
� Mac OS X (x86, PPC)

If you're interested in supporting in supporting other platforms or languages, please get in touch with Ray Smith.

Last updated on October 4th, 2010

#OCR engine #tiff reader #read color image #Tesseract #OCR #engine #tiff

Add your review!

SUBMIT