Convert scanned table PDF to HTML by OCR to Any Converter

    In the market, there are lots of software which can be used to convert PDF to HTML file and most of them are really good. However, there are seldom ones which can handle table PDF to HTML with good conversion effect. In order to solve this problem, VeryPDF developed software VeryPDF OCR to Any Converter Command Line, which can be used to convert table PDF to HTML perfectly. This software take special arithmetic which is good at handling tables in image or PDF accurately.  In the following part, I will show you how to use this software.

Step 1. Download OCR to Any Converter

  • All the VeryPDF software are free downloading and free trial. So you can download this software and free trial it tens of times.
  • When downloading finishes, there will be an zip file. Please extract it to some folder then you can call the executable file in MS Dos Windows.

Step 2. Convert Table PDF to HTML by command line

  • When run the conversion, please refer to the usage and examples.
  • Usage:     ocr2any.exe [options] <PDF-file> <Text-file>
  • When you convert table PDF to HTML file, please refer to the following command line template.
  • ocr2any.exe -ocr2 C:\in.gif C:\out.htm
    By this command line, you can convert table gif file to HTML file.
    ocr2any.exe -ocr2 C:\in.pdf C:\out.html
    By this command line, you can convert table PDF file to HTML.
    ocr2any.exe -ocr2 D:\temp\*.pdf D:\temp\*.html
    By this command line, you can convert PDF to HTML file in batch by wild character. There is another bath conversion mode for your reference.
    for %F in (D:\temp\*.pdf) do ocr2any.exe -ocr2 "%F" "C:\test\%~nF.html""
    ocr2any.exe -ocr2 D:\temp\*.tif D:\temp\*.html
    By this software, you can also convert table tiff file to HTML file. Meanwhile, this software supports more image as input and more file formats as output. Please check details on our website or readme.txt.
    Related Parameters:
    -ocr2            : use enhanced OCR module to convert scanned PDF and image files to RTF, DOC, TXT, CSV, Excel, HTML files
    -ocr2aor     : detect page direction and rotate it automatically when -ocr2 used
    -ocr2autorotate         : same as -ocr2aor
    -ocr2excelmode <int>    : set output Excel format when -ocr2 used
      -ocr2excelmode 0: One big sheet + All page sheets
      -ocr2excelmode 1: All page sheets
      -ocr2excelmode 2: One big sheet, default mode

This software provides more than three conversion modes for you to choose. Please choose the correct mode according to your conversion needs. Now let us check the conversion effect from the following snapshots. The first one is from the input PDF file. The second one is from the output HTML file.

input PDF file
                     Input PDF file

    output HTML file
                   Output HTML file.

The output HTML file is absolutely searchable. By this software, you can convert table PDF to HTML file easily and accurately. During the using, if you have any question, please contact us as soon as possible.

VN:F [1.9.20_1166]
Rating: 0.0/10 (0 votes cast)
VN:F [1.9.20_1166]
Rating: 0 (from 0 votes)

Related Posts

One Reply to “Convert scanned table PDF to HTML by OCR to Any Converter”

  1. thankyou for this video, it help so much! except the seoncd time i did it, all my videos i put on my ipad were upside down do you know why this is? and is there anything i can do to fix it?

    VA:F [1.9.20_1166]
    Rating: 0.0/5 (0 votes cast)
    VA:F [1.9.20_1166]
    Rating: 0 (from 0 votes)

Leave a Reply

Your email address will not be published. Required fields are marked *


Verify Code   If you cannot see the CheckCode image,please refresh the page again!