Create full-text searchable PDF file from image PDF

   If you are frequently PDF user, maybe you meet some PDF which can not allow you to do copy and paste, let alone do searching in that kind of PDF file. We call that kind of PDF as image PDF. In the following part, I will show you how to create full-text searchable PDF from image PDF. I use software VeryPDF Image to PDF OCR Converter Command Line, which can be used to convert image and PDF to searchable PDF and text.

Step 1. Download Image to PDF OCR Converter Command Line

  • There are many versions of this software stated on our website, please make sure download the right version. When downloading finishes, please extract it to some folder. Please call img2pdfnew.exe in MS Dos Windows.
  • Now there is only server version and developer version available on our website. But the server version can also be called from computer or laptop.

Step 2. Convert PDF to searchable PDF

  • When you run the conversion, please refer to the usage and examples.
  • Usage: img2pdf [options] <Image-file> [<PDF-file>]
    When you need to convert PDF to searchable PDF file, please refer to the following command line templates.
  • img2pdfnew.exe -ocr 1 -tsocr C:\in.pdf C:\out.pdf
    img2pdfnew.exe -ocr 1 -tsocr -plaintextpdf C:\in.pdf C:\out.pdf
    img2pdfnew.exe -ocr 1 -combineword 1 -bitcount 1 C:\in.pdf C:\out.pdf
    img2pdfnew.exe -ocr 1 C:\in.pdf C:\out.pdf
    Now let us check related parameters:
    -ocr <int>: when you need to create full-text searchable PDF file, please add this parameter.
    -tsocr : when you need to use tesseract-ocr engine, please add this parameter.
    -combineword <int>:this parameter can help you combine OCRed characters to words.
    -plaintextpdf : when you need to convert scanned image files or PDF files to pure text based PDF files, please add this parameter.

    Now let us check the conversion effect from the following snapshot.

    image PDF and output PDF 

    In the image PDF file, when you try to copy content, the bounced dialogue box reminding you to Copy Image. But in the text based PDF file, when you need to copy content, the bounced dialogue box reminding to you to copy. By the method, you can check whether the conversion is successfully or not.

If you do not add parameter –OCR, this software will convert image to normal image PDF. So with this one software, you can either convert image to PDF, image to searchable PDF  or convert image to text. All the VeryPDF software are free trial and free downloading. During the using, if you have any question, please contact us as soon as possible.

VN:F [1.9.20_1166]
Rating: 0.0/10 (0 votes cast)
VN:F [1.9.20_1166]
Rating: 0 (from 0 votes)

Related Posts

Leave a Reply

Your email address will not be published. Required fields are marked *


Verify Code   If you cannot see the CheckCode image,please refresh the page again!