To Do OCR in Python to This Image

Daniel2344 · October 28, 2021, 8:11pm

i try to do OCR in python to this image (the number inside can change)
i try everything
tesseract
EasyOCR
but every method doing a lot of mistake

2233 copy

Daniel2344 · October 29, 2021, 1:35am

imge = cv2.imread('2233.png', 0)
imge = cv2.resize(imge, None, fx=0.5, fy=0.5)
config = "--psm 7"
imge = pytesseract.image_to_string(imge, config=config)
print(imge)

martin · October 29, 2021, 3:56pm

Hi @Daniel2344,

Keyboard Maestro has a native action for OCR.

However, when I tried the image you uploaded, I got an error:

Hi @peternlewis, what does this warning mean and how to pre-process the image file to get OCR to work?

mrpasini · October 29, 2021, 4:27pm

You might find this discussion illuminating. Which would lead you to invert the image for OCR so it's black on white.

peternlewis · October 30, 2021, 12:52am

The Tesseract library often produces that warning. Keyboard Maestro ignores it if anything else is produced - however if nothing else is produced it allows it through since it might be the case.

In this case, its just not finding anything.

Even inverting it is insufficient, the circle around the outside needs to be removed and it needs to be inverted to get the text to work.

Removing the circle is easily done with cropping.

Keyboard Maestro does not currently have any action to invert an image I'm afraid.

mrpasini · October 30, 2021, 4:34am

No, but if the color scheme is caused by Dark Mode, that's easily addressed.

martin · October 31, 2021, 7:52pm

I can confirm that both inverting colors and cropping are needed.

Here are my tests (see screenshot of my clipboard history. Order: from bottom to top):

Fist to just the inverting colors, KM still shows the warning.
screenshot the characters inside the circle, which I take it to be equivalent to cropping, the characters are correctly OCRed → 25%.
screenshot the original, non-color-inverted character (white character in black background), the OCR result is hey, not 25%.

This shows that both color-inverting and cropping are needed.

To Do OCR in Python to This Image

Options