In the market, there are lots of software which can be used to convert PDF to HTML file and most of them are really good. However, there are seldom ones which can handle table PDF to HTML with good conversion effect. In order to solve this problem, VeryPDF developed software VeryPDF OCR to Any Converter Command Line, which can be used to convert table PDF to HTML perfectly. This software take special arithmetic which is good at handling tables in image or PDF accurately. In the following part, I will show you how to use this software.
Step 1. Download OCR to Any Converter
- All the VeryPDF software are free downloading and free trial. So you can download this software and free trial it tens of times.
- When downloading finishes, there will be an zip file. Please extract it to some folder then you can call the executable file in MS Dos Windows.
Step 2. Convert Table PDF to HTML by
- When run the conversion, please refer to the usage and examples.
- Usage: ocr2any.exe [options] <PDF-file> <Text-file>
- When you convert table PDF to HTML file, please refer to the following template.
ocr2any.exe -ocr2 C:\in.gif C:\out.htm
By this command line, you can convert table gif file to HTML file.
ocr2any.exe -ocr2 C:\in.pdf C:\out.html
By this command line, you can convert table PDF file to HTML.
ocr2any.exe -ocr2 D:\temp\*.pdf D:\temp\*.html
By this command line, you can convert PDF to HTML file in batch by wild character. There is another bath conversion mode for your reference.
for %F in (D:\temp\*.pdf) do ocr2any.exe -ocr2 “%F” “C:\test\%~nF.html””
ocr2any.exe -ocr2 D:\temp\*.tif D:\temp\*.html
By this software, you can also convert table tiff file to HTML file. Meanwhile, this software supports more image as input and more file formats as output. Please check details on our website or readme.txt.
–ocr2 : use enhanced OCR module to convert scanned PDF and image files to RTF, DOC, TXT, CSV, Excel, HTML files
–ocr2aor : detect page direction and rotate it automatically when -ocr2 used
–ocr2autorotate : same as -ocr2aor
–ocr2excelmode <int> : set output Excel format when -ocr2 used
–ocr2excelmode 0: One big sheet + All page sheets
–ocr2excelmode 1: All page sheets
–ocr2excelmode 2: One big sheet, default mode
This software provides more than three conversion modes for you to choose. Please choose the correct mode according to your conversion needs. Now let us check the conversion effect from the following snapshots. The first one is from the input PDF file. The second one is from the output HTML file.
The output HTML file is absolutely searchable. By this software, you can convert table PDF to HTML file easily and accurately. During the using, if you have any question, please contact us as soon as possible.