What is the best software to convert non searchable PDF files to searchable PDF files with OCR function?

>>We want to directly convert non-searchable PDF to searchable PDF.
>>What is the best tool to convert nonsearchable.pdf to searchable.pdf ?
>>We have found "image2pdf_cmd_ocr_v5.0" can do this , but searchable PDF are not as the requirement.

"VeryPDF OCR to Any Converter Command Line" is a good software to convert nonsearchable.pdf to searchable.pdf, you may download it from following web page to try,

https://www.verypdf.com/app/ocr-to-any-converter-cmd/try-and-buy.html

The following page is the user guide,

https://www.verypdf.com/app/ocr-to-any-converter-cmd/user-guide.html

You can run following command lines to convert your PDF files to searchable PDF files easily,

ocr2any.exe -ocr -ocrmode 4 -res 72 _test_color.pdf _test-pdf2pdf-grayscale.pdf
ocr2any.exe -ocr -ocrmode 4 -res 72 -bitcount 24 _test_color.pdf _test-pdf2pdf-color.pdf

-ocrmode <int> : set OCR mode, the value can be selected from 0 to 4,
        -ocrmode 0: output to text file
        -ocrmode 1: OCR PDF pages and insert new text layer under original PDF pages
        -ocrmode 2: output to plain text based PDF file
        -ocrmode 3: output to OCRed PDF file (BW) with hidden text layer
        -ocrmode 4: output to OCRed PDF file (Color) with hidden text layer
 
ocr2any.exe -ocr -ocrmode 1 test_multi_columns.pdf  _test_multi_columns_mode1.pdf
ocr2any.exe -ocr -ocrmode 2 test_multi_columns.pdf  _test_multi_columns_mode2.pdf
ocr2any.exe -ocr -ocrmode 3 test_multi_columns.pdf  _test_multi_columns_mode3.pdf
ocr2any.exe -ocr -ocrmode 3 -ownerpwdout 123 -keylen 2 -encryption 3900 test_multi_columns.pdf  _test_multi_columns_mode3_encryption.pdf

>>1>We have created two jpg files and created searchable PDF with two input images.

img2pdf.exe -x 1 -o c:\xampp\htdocs\verypdf.com\image2pdf_cmd_ocr_v5.0\samplefiles\output\1_searchable_with_two_input_images.pdf c:\xampp\htdocs\verypdf.com\image2pdf_cmd_ocr_v5.0\samplefiles\input\p1.jpg c:\xampp\htdocs\yellowfolder\verypdf.com\image2pdf_cmd_ocr_v5.0\samplefiles\input\p2.jpg

Output "1_searchable_with_two_input_images.pdf" is 3.31 MB and good quality.

>>2>We have non-searchable PDF and we have converted searchable PDF

img2pdf.exe -x 1 -o c:\xampp\htdocs\verypdf.com\image2pdf_cmd_ocr_v5.0\samplefiles\output\1_searchable_with_pdf.pdf c:\xampp\htdocs\verypdf.com\image2pdf_cmd_ocr_v5.0\samplefiles\input\nonsearchable.pdf

Output : 1_searchable_with_pdf.pdf -- 458 KB only very bad quality

>>Please find our attached zip file with input and output folder.

We can't found the attached zip file in this ticket, can you please resend this attached zip file to us again?

>>Please guide what we need to do , to achieve following.
>>Also correct us if we have identified a wrong tool for the functionality below.
>>1>Multiple Images to Multi page PDF
>>
https://www.verypdf.com/app/image-to-pdf-ocr-converter/try-and-buy.html
>>Or any other best tool.

Yes.

If you want convert Multiple Images to Multi page PDF with OCR option, you can also use our "VeryPDF OCR to Any Converter Command Line" software, "VeryPDF OCR to Any Converter Command Line" software has more functions than "Image to PDF OCR Command Line" software,

https://www.verypdf.com/app/ocr-to-any-converter-cmd/try-and-buy.html

>>2>Multipage PDF to Multiple Images (no of pages in PDF , same number of images)
>>
https://www.verypdf.com/app/pdf-to-image-converter/try-and-buy.html
>>Or any other best tool.

Yes.

PDF to Image Converter Command Line is a best software to convert PDF files to image files.

>>3>Metatags and custom tags to PDF
>>
https://www.verypdf.com/app/advanced-pdf-tools/try-and-buy.html
>>Or any other best tool.

Yes, Advanced PDF Tools Command Line is the best software to modify tags in PDF files.

>>4>PDF to txt OR images to txt
>>pdf2txtocrcmd and ocr2any_cmd --> both giving same output , which is best for us?

If you just want convert PDF files to text files, pdf2txtocrcmd is enough for you.

If you want convert PDF files to Text, Word, Excel, HTML, PDF, etc. formats, you will need to choose ocr2any_cmd, ocr2any_cmd is not only generate text files, but also generate lots of other formats.

>>5>Non searchable to Searchable PDF directly.
>>Please suggest best tool.

"VeryPDF OCR to Any Converter Command Line" is a good software to convert nonsearchable.pdf to searchable.pdf, you may download it from following web page to try,

https://www.verypdf.com/app/ocr-to-any-converter-cmd/try-and-buy.html

>>6>Please suggest best tool to OCR , handwritten images or PDF.
>>Up to what extend (percentage) this tools will support handwritten scanned images or PDF files?

Thanks for your message, our OCR engines are only support machine printed characters, but they can't support handwritten characters very well, please understand.

>>also we have few query regarding performance on the server.
>>* Response Time: 2 seconds - can it will support this?
>>* Total Number of User: 1400 users – can it will support this?
>>* Concurrent users: 100 users – can it will support this?

Yes, our "VeryPDF OCR to Any Converter Command Line" software will reach above performance, you may download the trial version from our website and test it by yourself, if you encounter any problem, please feel free to let us know.

>>What kind of server configuration (Linux OS , HDD and RAM )it will need to install Linux version of this tool and support above criteria?
>>Please suggest.

System requirements to Linux version,
1. Default Linux OS is CentOS 5 or 6, if you need a version for other Linux OS, please let us know.
2. HDD space is optional, but it should better has 2G free space or more.
3. RAM is optional, but it should better has 2G RAM or more, because more RAM will get higher conversion speed.

VeryPDF

VN:F [1.9.20_1166]
Rating: 0.0/10 (0 votes cast)
VN:F [1.9.20_1166]
Rating: 0 (from 0 votes)

Related Posts

Leave a Reply

Your email address will not be published. Required fields are marked *


Verify Code   If you cannot see the CheckCode image,please refresh the page again!