Sample Header Ad - 728x90

How to OCR a PDF file and get the text stored within the PDF?

102 votes
4 answers
79778 views
First, apologies if this has been asked before - I searched for a while through the existing posts, but could not find support. I am interested in a solution for Fedora to OCR a multipage non-searchable PDF and to turn this PDF into a new PDF file that contains the text layer on top of the image. On Mac OSX or Windows we could use Adobe Acrobat, but is there a solution on Linux, specifically on Fedora? [This](https://web.archive.org/web/20190807064639/https://snippets.webaware.com.au/howto/pdf-ocr-linux/) seems to describe a solution - but unfortunately I am already lost when retrieving exact-image.
Asked by ingli (2029 rep)
Aug 4, 2016, 03:39 PM
Last activity: Mar 27, 2025, 11:54 AM