How to convert Excel reports of PDF to HTML formats for website building?

In this article, I will show you how to convert Excel reports of PDF to HTML for website building based on raw data and equations. The software I will use named VeryPDF OCR to Any Converter Command Line, which can recognizing tables in PDF or  images and save the table to editable general formats. Please check more information of this software on homepage. In the following part, I will show you how to use this software.

Step 1. Download OCR to Any Converter Command Line

  • All the VeryPDF software is absolutely clear and safe to free trial, so you can rest assured to use it free many times.
  • And if you do not know how to use the command line version, there is also GUI version available for you.
  • When downloading command line version, there will be zip file in downloading folder. You need to extract it to some folder then you can call it from MS Dos Windows.

Step 2. Convert table PDF to HTML by command line.

    • When converting table PDF to HTML, please obey rules of this software and follow examples.
    • Usage:  ocr2any.exe [options] <PDF-file> <Text-file>
    • When converting table PDF to HTML or Excel, this software provides 4 OCR modes. Please check related parameters in the following part.

-ocr2excelmode <int>    : set output Excel format when -ocr2 used
-ocr2excelmode 0: One big sheet + All page sheets
-ocr2excelmode 1: All page sheets
-ocr2excelmode 2: One big sheet, default mode
Some examples for your reference:
ocr2any.exe -ocr2 -ocr2excelmode 0 C:\in.pdf C:\out.xls
ocr2any.exe -ocr2 -ocr2excelmode 1 C:\in.pdf C:\out.xls
ocr2any.exe -ocr2 -ocr2excelmode 2 C:\in.pdf C:\out.xls
You can convert table PDF to Excel first then you can calculate some equation and then save Excel to HTML.

    • When you need to convert table PDF to HTML directly, please refer to the following command line templates.

ocr2any.exe -ocr2 C:\in.pdf C:\out.html
By this command line, we can convert single PDF to HTML using OCR mode 2.
ocr2any.exe -ocr2 D:\temp\*.pdf D:\temp\*.html
By this command line, we can convert PDF to HTML in batch using wild character.
for %F in (D:\temp\*.pdf) do ocr2any.exe -ocr2 "%F" "C:\test\%~nF.html""
By this command line, you can write bat file for batch conversion automatically.
ocr2any.exe -ocr2 D:\temp\*.tif D:\temp\*.html
This software also can help you convert table Tiff file to HTML file.
-ocr2         : use enhanced OCR module to convert scanned PDF and image files to RTF, DOC, TXT, CSV, Excel, HTML files
-ocr2aor   : detect page direction and rotate it automatically when -ocr2 used

Now let us check the conversion effect from the following snapshot.

This is origianl PDF file
This is from original PDF file.

output HTML file from table PDF
This is from output HTML file.

During the using, if you have any question, please contact us as soon as possible.

VN:F [1.9.20_1166]
Rating: 10.0/10 (1 vote cast)
VN:F [1.9.20_1166]
Rating: 0 (from 0 votes)
How to convert Excel reports of PDF to HTML formats for website building?, 10.0 out of 10 based on 1 rating

Related Posts

This entry was posted in OCR Products and tagged , . Bookmark the permalink.

Leave a Reply

Your email address will not be published. Required fields are marked *


Verify Code   If you cannot see the CheckCode image,please refresh the page again!