In this article, I will show you how to convert Excel reports of PDF to HTML for website building based on raw data and equations. The software I will use named VeryPDF OCR to Any Converter Command Line, which can recognizing tables in PDF or images and save the table to editable general formats. Please check more information of this software on homepage. In the following part, I will show you how to use this software.
- All the VeryPDF software is absolutely clear and safe to free trial, so you can rest assured to use it free many times.
- And if you do not know how to use the version, there is also GUI version available for you.
- When downloading version, there will be zip file in downloading folder. You need to extract it to some folder then you can call it from MS Dos Windows.
Step 2. Convert table PDF to HTML by command line.
- When converting table PDF to HTML, please obey rules of this software and follow examples.
- Usage: ocr2any.exe [options] <PDF-file> <Text-file>
- When converting table PDF to HTML or Excel, this software provides 4 OCR modes. Please check related parameters in the following part.
-ocr2excelmode <int> : set output Excel format when -ocr2 used
-ocr2excelmode 0: One big sheet + All page sheets
-ocr2excelmode 1: All page sheets
-ocr2excelmode 2: One big sheet, default mode
Some examples for your reference:
ocr2any.exe -ocr2 -ocr2excelmode 0 C:\in.pdf C:\out.xls
ocr2any.exe -ocr2 -ocr2excelmode 1 C:\in.pdf C:\out.xls
ocr2any.exe -ocr2 -ocr2excelmode 2 C:\in.pdf C:\out.xls
You can convert table PDF to Excel first then you can calculate some equation and then save Excel to HTML.
- When you need to convert table PDF to HTML directly, please refer to the following command line templates.
ocr2any.exe -ocr2 C:\in.pdf C:\out.html
By this command line, we can convert single PDF to HTML using OCR mode 2.
ocr2any.exe -ocr2 D:\temp\*.pdf D:\temp\*.html
By this command line, we can convert PDF to HTML in batch using wild character.
for %F in (D:\temp\*.pdf) do ocr2any.exe -ocr2 "%F" "C:\test\%~nF.html""
By this command line, you can write bat file for batch conversion automatically.
ocr2any.exe -ocr2 D:\temp\*.tif D:\temp\*.html
This software also can help you convert table Tiff file to HTML file.
-ocr2 : use enhanced OCR module to convert scanned PDF and image files to RTF, DOC, TXT, CSV, Excel, HTML files
-ocr2aor : detect page direction and rotate it automatically when -ocr2 used
Now let us check the conversion effect from the following snapshot.
During the using, if you have any question, please contact us as soon as possible.