Now using the tesseract engine by default!
Optical Character Recognition is a GUI application and CLI script for Nautilus/GNOME. This script will convert almost any image of text (except PDF's) into editable text.
For OCRing, the preferred formats are of the tiff or pnm class: .tiff .tif .pnm .pbm .pgm .ppm.
Compatible formats are: .jpg .jpeg .gif .png .bmp .xcf .pct .pict.
The applied recognition engines have a very high character recognition success rate compared to other OCR's, including proprietary software.
Product's homepage
Requirements:
· Nautilus