pytesser - OCR library for Python (optical character recognition)

    Today I wandered around projects on Google and came across a rather interesting development - pytesser. According to the developer, recognize the text in the image. The library uses tesseract-ocr and PIL.

    Usage example: The big minus for me is the ability to use only on Windows. Therefore, I could not test this development. Included with the library for the test are the following image, which it should (in theory, once the author has placed it there) to recognize: I would like to hear the results of testing on other images from the habra-people. Official page: code.google.com/p/pytesser

    >>> from pytesser import *
    >>> image = Image.open('fnord.tif') # Open image object using PIL
    >>> print image_to_string(image) # Run tesseract.exe on image
    fnord
    >>> print image_file_to_string('fnord.tif')
    fnord






    image




    Also popular now: