How to OCR a PDF file and get the text stored within the PDF?

102 votes

4 answers

79778 views

                          First, apologies if this has been asked before - I searched for a while through the existing posts, but could not find support.

I am interested in a solution for Fedora to OCR a multipage non-searchable PDF and to turn this PDF into a new PDF file that contains the text layer on top of the image. On Mac OSX or Windows we could use Adobe Acrobat, but is there a solution on Linux, specifically on Fedora?

[This](https://web.archive.org/web/20190807064639/https://snippets.webaware.com.au/howto/pdf-ocr-linux/)  seems to describe a solution - but unfortunately I am already lost when retrieving exact-image.

Asked by ingli (2029 rep)

Aug 4, 2016, 03:39 PM
Last activity: Mar 27, 2025, 11:54 AM

How to OCR a PDF file and get the text stored within the PDF?

Related Questions