How to OCR a PDF file and get the text stored within the PDF?
102
votes
4
answers
79778
views
First, apologies if this has been asked before - I searched for a while through the existing posts, but could not find support.
I am interested in a solution for Fedora to OCR a multipage non-searchable PDF and to turn this PDF into a new PDF file that contains the text layer on top of the image. On Mac OSX or Windows we could use Adobe Acrobat, but is there a solution on Linux, specifically on Fedora?
[This](https://web.archive.org/web/20190807064639/https://snippets.webaware.com.au/howto/pdf-ocr-linux/) seems to describe a solution - but unfortunately I am already lost when retrieving exact-image.
Asked by ingli
(2029 rep)
Aug 4, 2016, 03:39 PM
Last activity: Mar 27, 2025, 11:54 AM
Last activity: Mar 27, 2025, 11:54 AM