I have huge trouble with a big collection of PDF+text files I have.
Since I work with mostly (sensitive) legal documents, and maintain an archive/database of them, that needs to be very searchable, I recently installed DevonThink Pro. With DevonThink you can do some advanced searches through your documents, and also PDF’s as long as they are searchable.
The problem that occurs is that DevonTHink can’t search inside the PDF+text files. After taking up the problem with them, they say it is an issue with Apple’s PDFkit. And that seems to be true, as when I open the PDFs in preview I’m also not able to select single words (it “selects” in multiple lines of text).
As it is impossible for me to print out all these documents in my archive, then scan them and OCR them to make them properly searchable, I wondered if you guys now a workaround to make these PDF+text files “flat” or “basis” or “simple” or whatever, so that I can OCR them and have get the searchability back.
Thank you in advance,
Thanks for your message, we have following products which can be used to convert normal PDF files to image based PDF files or searchable PDF files.
PDF to Text OCR Converter Command Line,
OCR to Any Converter Command Line,
Image to PDF OCR Converter Command Line,
You can use these products to convert from normal PDF files or scanned PDF files to searchable PDF files quickly, we hope above products will work fine to you.