Optical character recognition (OCR)
Recognise print text in images with any of these tools.
ABBYY FineReader is considered very good for OCR in many languages. However, it is not cheap.
Adobe Acrobat has OCR capabilities for PDFs.
Tesseract is open source and free to use. It has support for many languages, can be (re)trained for other languages and achieves good results. It requires a little knowledge of the command line and supports only a limited number of image formats for inputs.
Kraken / Ocropy¶
Kraken and Ocropy are related OCR applications, as they build on one another. They are both open source and free to use. Kraken is relatively easy to train. Instead of full pages, Kraken recognises text in single lines.
Calamari is also open source and free to use.