Convert text based PDF to plain text files by OCR to Any Converter CMD

VeryPDF OCR to Any Converter Command Line was released recently. By this software, you can convert scanned PDF, TIFF and Image files to editable Word, Excel, CSV, HTML, TXT, Pure Text Layer PDF, Invisible Text Layer PDF, etc. formats. In this article, I will show you how to convert text based PDF file to plain text. Please note the text based PDF refer to the ones which can be copied and pasted fonts in it. By this method, you can not handle image based PDF files.

Step 1. Download OCR to Any Converter

  • There are server version and developer version available on our website. By the server version, you can also use it on single computer or laptop.
  • When downloading finishes, please unzip the zip file then call it from MS Dos Windows. Also you can call it together with C#, VB.NET, ASP.NET, VB, VC, Delphi, ASP, PHP, Javascript, VB Script, etc.

Step 2. Convert text based PDF to plain text

  • When you use this software, please refer the usage and examples. And simply run the bat file then you can check the conversion effect.
  • Usage:       ocr2any.exe [options] <PDF-file> <Text-file>
  • When you need to convert test based PDF to plain text, please refer to the following command line templates.

    ocr2any.exe C:\in.pdf C:\out.txt
    By this command line, you can convert single PDF file to text.
    ocr2any.exe -firstpage 1 -lastpage 1 C:\in.pdf C:\out.txt
    By this command line, you can convert first page of PDF file to text. Also by this method, you convert PDF to text and specify conversion page range. By those two parameters, the conversion efficiency can be enhanced greatly.  
    ocr2any.exe -ownerpwd 123 -userpwd 456 C:\in.pdf C:\out.txt
    By this command line template, you can convert password protested PDF to text. If you do not know the owner password, simply input one.
    ocr2any.exe -layout C:\in.pdf C:\out.txt
    When you need to convert PDF to text and maintain layout in original PDF file, please add this parameter.
    ocr2any.exe -noc C:\in.pdf C:\out.txt
    If you need to convert all pages of PDF content into one text file, please refer to this command line.

Now let us check related parameters.

-firstpage <int>:please use this parameter to specify the first PDF page to convert
-lastpage <int>: specify the last PDF page to convert
-res <int>: when you need to set resolution, the unit is DPI (default is 300 dpi), please add this parameter.
-ownerpwd <string>: please input owner password for encrypted PDF file when do converting
-userpwd <string>: please input user password when processing encrypted PDF file
-layout: when you need to maintain original physical layout of PDF in text file, please add this parameter

Now let us check the conversion effect from the following snapshot.

input PDF file and output text

We will publish more articles in our knowledge base about this software,if you are interested in this software, please pay more attention to our website. During the using, if you have any question, please contact us as soon as possible.

 

VN:F [1.9.20_1166]
Rating: 0.0/10 (0 votes cast)
VN:F [1.9.20_1166]
Rating: 0 (from 0 votes)

Related Posts

One Reply to “Convert text based PDF to plain text files by OCR to Any Converter CMD”

  1. Thanks for your message, the following products are all can convert scanned PDF files and TIFF files to searchable PDF files, the output PDF files will contain a hidden text layer, you can open OCRed PDF files in Adobe Reader and search text contents properly,

    Image to PDF OCR Converter Command Line,
    http://www.verypdf.com/app/image-to-pdf-ocr-converter/try-and-buy.html#buy-ocr-cmd

    PDF to Text OCR Converter Command Line,
    http://www.verypdf.com/app/pdf-to-text-ocr-converter/try-and-buy.html#buy

    VeryPDF OCR to Any Converter Command Line,
    http://www.verypdf.com/app/ocr-to-any-converter-cmd/try-and-buy.html

    Please look at following web pages for more information,

    http://www.verypdf.com/wordpress/201211/convert-scanned-pdf-to-searchable-pdf-without-losing-color-32937.html

    http://www.verypdf.com/wordpress/201312/bulk-scanned-pdf-files-to-searchable-pdf-files-batch-converter-40025.html

    http://www.verypdf.com/wordpress/201211/convert-image-and-scanned-pdf-to-searchable-pdf-32896.html

    VN:F [1.9.20_1166]
    Rating: 0.0/5 (0 votes cast)
    VN:F [1.9.20_1166]
    Rating: 0 (from 0 votes)

Leave a Reply

Your email address will not be published. Required fields are marked *


Verify Code   If you cannot see the CheckCode image,please refresh the page again!