expee January 14, 2010 at 22:18

pytesser - OCR library for Python (optical character recognition)

Today I wandered around projects on Google and came across a rather interesting development - pytesser. According to the developer, recognize the text in the image. The library uses tesseract-ocr and PIL.

Usage example: The big minus for me is the ability to use only on Windows. Therefore, I could not test this development. Included with the library for the test are the following image, which it should (in theory, once the author has placed it there) to recognize: I would like to hear the results of testing on other images from the habra-people. Official page: code.google.com/p/pytesser

>>> from pytesser import *

>>> image = Image.open('fnord.tif') # Open image object using PIL

>>> print image_to_string(image) # Run tesseract.exe on image

fnord

>>> print image_file_to_string('fnord.tif')

fnord

Tags:

pytesser
python
ocr

pytesser - OCR library for Python (optical character recognition)

Also popular now: