OCRopus 0.4

OCRopus is an open source document analysis and OCR system.
OCRopus is an open source document analysis and OCR system.

OCRopus is a state-of-the-art document analysis and OCR system, featuring pluggable layout analysis, pluggable character recognition, statistical natural language modeling, and multi-lingual capabilities.

The OCRopus engine is based on two research projects: a high-performance handwriting recognizer developed in the mid-90's and deployed by the US Census bureau, and novel high-performance layout analysis methods.

OCRopus is development is sponsored by Google and is initially intended for high-throughput, high-volume document conversion efforts. We expect that it will also be an excellent OCR system for many other applications.

Note: To use the application, run "ocropus-batch png" from the directory containing the text images.

last updated on:
June 15th, 2009, 14:21 GMT
license type:
The Apache License 2.0 
developed by:
Hagen Kaprykowsky
ROOT \ Multimedia \ Graphics
Download Button

In a hurry? Add it to your Download Basket!

user rating 4



Rate it!
What's New in This Release:
  • OCRopus has been turned into a library
  • there is a new set of command line programs for book-level recognition
  • there is a new line recognizer
  • there is a new component model
read full changelog

Add your review!