I am trying out the pdftotxtocr.exe and was wondering if there is a switch to disable OCR processing. I tried running without the "-ocr" switch but the program still proceed to step "This PDF file seems not contain text contents. We will use OCR technology to recognize this PDF file continue..."
Thanks for your message, the free trial version of PDF to Text OCR Converter hasn't an option to disable OCR function, however, after you purchased it, please email to us your Order ID, we will send a new version to you, the new version has a "-disableocr" parameter which can be used to disable OCR function completely.
Thanks for your response. I will send you the order ID soon.
Another question. I am trying to extract text from the attached scanned sketch but got garbled result. What should be correct switches in order to have the best OCR recognition results?
Our OCR engine doesn't support handwritten characters, it is support printed characters only, so it can't convert handwritten characters in your PDF file into editable Word document, please notice this matter.
If the OCR engine does not support handwritten characters, then how come the program still generate output file with many non alphanumeric characters (see attached)? Is there a way to instruct the program to simply output a blank txt file in this situation?
Our OCR engine does not support handwritten characters, it will create garbage characters from handwritten characters, we haven't a way to simply output a blank txt file in this situation, sorry for this matter.