OCR Image - Your Experiences with Text Recognition

appleianer · August 13, 2019, 10:51pm

First of all many thanks to @peternlewis for the successful update and the fair upgrade price

Since today we have the OCR Image action in version 9. What is your experience with text recognition?

Here is a comparison with the text recognition of the German language with Snagit:

Here my KM macro, if somebody would like to try it for himself:

Screenshot to OCR:Clipboard .kmmacros (2,1 KB)
37%20AM

peternlewis · August 14, 2019, 3:46am

It would be much easier to see the source image used, and the two resulting German OCRed text.

Not that it is likely to make any difference, Keyboard Maestro uses Tesseract, and the Tesseract training data. It is technically possible to train it yourself, but it is certainly possible that the German training data is inadequate.

That said, in my testing in English, it has worked surprisingly well for me, although I do admit I generally am using it on “perfect” text, captured from screen, rather than scanned from a magazine article.

Kazou · August 14, 2019, 8:19am

Thank you for the upgrade, and this macro, it's working fine/ for the few test I made in French Language.

ALYB · August 14, 2019, 10:45am

Peter, thanks for the great update (I'll certainly use the coupon).

A question about OCR'ing German texts: Does one have to download the German training data manually?

appleianer · August 14, 2019, 10:53am

Yes, you @ALYB can select your preferred language here and must download it the first time.

ScreenFlow

Here is the website (Github) of Tesseract Open Source

appleianer · August 14, 2019, 11:08am

Thank you for your feedback and Explain about OCR Recognition @peternlewis

Below (1) you can see the text recognition of KM. The result under (2) is from Snagit text recognition:

weberwa · August 14, 2019, 11:12am

Thank you, @appleianer , for sharing your OCR macro.
My first experience with screenshot OCR in German is excellent, exactly what I was searching for!
Walter

appleianer · August 14, 2019, 11:19am

Glad to hear it @weberwa. I will try again with a better picture. The result is already not bad (gif above).

It's definitely a faster solution than having to add more apps to your macro to get an OCR text.

The OCR solutions (AbbyyFine Reader, PDFpenPro etc.) don't always work 100%, either

BlackMac · August 14, 2019, 1:27pm

Hello everyone, this is my first contribution. I have only recently started working with Keyboard Maestro, but with great enthusiasm. What a unique software and a great community here!

The OCR function works perfectly, but not in every way. If a screenshot is made directly to the clipboard under Mojave, it doesn't work. The log says:

OCR Image cannot get target In macro “OCR Zwischenablage” (while executing OCR System Clipboard).
Assertion Failed: ainfo, file: …/Source/Actions/AOCRImage.mm:110, value: 0

However, when I save the screenshot as a file, open it in Preview and copy it to the clipboard, it works. Strange …

appleianer · August 14, 2019, 1:38pm

Hi @BlackMac, Try my macro which I have attached in this post.

BlackMac · August 14, 2019, 1:50pm

Hi appleianer

Your macro works fine, many thanks for your help. It’s not my preferred workflow, but it will do the trick.

But what is the problem with the regular OCR Image function shown in my screenshot? The macro looks as simple as it can be. Is it a bug?

peternlewis · August 14, 2019, 1:57pm

When you select the language for the first time, Keyboard Maestro will download it.

peternlewis · August 14, 2019, 2:02pm

Your macro looks fine, and I had no problems with the same action, taken from a screenshot in Mojave to the clipboard.

Before you run the macro, check your Clipboard History Switcher, and verify that the image is in the clipboard as expected, and has not been replaced by some text or something else.

BlackMac · August 14, 2019, 3:07pm

Thank you for your input. I restarted the Mac and immediately executed the macro, with the same result. Maybe this is just a problem on my machine. I hope a clean install with Catalina in September will solve the problem. In the meantime, I will find a workaround or use the solution from appleianer.

peternlewis · August 15, 2019, 6:18am

Before you run the macro, check your Clipboard History Switcher, and verify that the image is in the clipboard as expected, and has not been replaced by some text or something else.

Then use the built in OCR Image macro (via the gear menu on the image entry in the Clipboard History Switcher).

BlackMac · August 15, 2019, 2:53pm

Okay, this is interesting.

I copy a part of the screen with Command-Shift-4 to the clipboard.
I see this part when I switch to the Finder and use the “Show clipboard” command.
But the Clipboard History Switcher remains empty. No image.

Then:

When I open Preview and choose “New from clipboard” (or similar, I use the german localization), I can see the image.
When I copy this image with Command-A and Command-C, it appears immediately in the Clipboard History Switcher and the OCR command behind the gear icon works as expected.

Is this useful?

zachnfine · August 16, 2019, 12:09am

It's fantastic to have OCR built into Keyboard Maestro 9, and it's using 'Tesseract', the open-source OCR engine I'd previously hacked into some of my shortcuts using shell commands.

I've got a pattern and whitelist that I'd previously used with Tesseract to OCR SMPTE timecodes, and I'd love to be able to use this as a custom language inside Keyboard Maestro's OCR step. Is there any way to do this? I did some searching and tried plopping them into ~/Library/Application\ Support/Keyboard\ Maestro/Tesseract\ Trained\ Data/Custom, but that doesn't seem to have had any effect. I've attached those customization files in case anyone wants to try to get them to work.

timecode_for_tesseract.zip (2.3 KB)

HigashiFukai · August 16, 2019, 6:50am

Hi,

Wow! I made a few tests in Japanese and it works very well.

One problem, though, is that it works mainly for horizontal text.
I checked for a solution in Tesseract’s documentation, and it says that this option is available for vertical text recognition:
tesseract-ocr-jpn-vert

It would then be great to have the two choices in the macro menu:
tesseract-ocr-jpn for horizontal text
tesseract-ocr-jpn-vert for vertical text

peternlewis · August 16, 2019, 7:29am

I can't seem to duplicate this, but I do think I saw that sort of behaviour when I was testing something earlier, so it is possible there is a bug somewhere with the clipboard switcher display.

peternlewis · August 16, 2019, 7:32am

Just should be able to do it, but the file must be in aTesseract .traineddata file (and don't ask me what the format of that is or how you generate one, as I have no idea).

OCR Image - Your Experiences with Text Recognition

Options