Extract date from scanned document PDFs (letters, invoices etc.)

Hey @Marcs,

This is a fairly simple task, but to do it well requires some tinkering...

Read at least the first post here:

Keyboard Maestro “Convert PDF Files into Text Files in the Front Finder Window” Macro

You need the 64-bit executables from here:

Download Xpdf and XpdfReader

Once installed you can feed the path to any given PDF file to an Execute a Shell Script action to extract structured text from it.

pdftotext -layout <POSIX_Path_of_Your_File> -

From there you can go-to-town.

-Chris

1 Like