OCR high res images & combine OCR data later, after image compression?
2
votes
0
answers
50
views
I have a large number of .tif's coming out of ScanTailor. Is there a way that I might OCR those .tif's with tesseract, holding the OCR data separate from the images; then compress the images, and finally combine the OCR data with the compressed images?
The point is that I don't want to compress before I OCR, and the tools for compressing the pdf's later, preserving the OCR, are not great.
Asked by Diagon
(740 rep)
Jul 7, 2023, 10:50 PM