How to convert PDF to word or Excel automatically by OCR to Any Converter?

Question: VeryPDF OCR to Any Converter - how does the watch / output folder options work? Can I set it to auto convert PDF docs to Excel or Word? I set up the watch folder as the manual said. Its ignoring when I drop PDF files into that folder. What is the recognized image type? Also can I set the output format or is it automatic?

Answer: When you need to watch / output folder options for converting PDF to word or PDF to Excel automatically, please use software VeryPDF OCR to Any Converter command line version. By the GUI version, it only can be used to monitor folder and process image file automatically. By the GUI version, you can convert image file like tiff, pang, jpg to text automatically by monitoring folders. Once you add image files to monitored folder, they will be converted to text file, word file or others at once. But now it can not process PDF file, we will fix this bug in new edition. For now, please write bat file to convert PDF to word, PDF to Excel automatically. Please check related information of this software on homepage, in the following part, I will show you how to make it.

Step 1. Please free download OCR to Any Converter Command Line.

  • This is command line version software, when downloading finishes, there will be a zip file. Please extract it to some folder then you can call it from MS Dos Windows normally.
  • When you use this software, please refer to the usage and examples in the readme.txt file. Please check the readme.txt carefully, there are many examples in it.

Step 2. Write bat file for converting PDF to word, PDF to Excel automatically.

  • Here is the usage: ocr2any.exe [options] <PDF-file> <Text-file>
  • When writing a .bat file, please refer to the following command line:

    Convert all TIFF files to text files, from C:\input\*.tif to D:\output\*.txt,
    ------------------------------------------------
    REM @ECHO OFF
    :retry
    for %%F in (C:\input\*.tif) do (
    E:\ocr2any\ocr2any.exe "%%F" "D:\output\%%~nF.txt"
    move "%%F" "%%F.bak"
    )
    ping -n 5 127.0.0.1 > nul
    goto retry

    ------------------------------------------------

    Convert all TIFF files to Word documents, from C:\input\*.tif to D:\output\*.doc,
    ------------------------------------------------
    REM @ECHO OFF
    :retry
    for %%F in (C:\input\*.tif) do (
    E:\ocr2any\ocr2any.exe -ocr2 "%%F" "D:\output\%%~nF.doc"
    move "%%F" "%%F.bak"
    )
    ping -n 5 127.0.0.1 > nul
    goto retry

    ------------------------------------------------

    Convert all TIFF files to Excel spreadsheets, from C:\input\*.tif to D:\output\*.xls,
    ------------------------------------------------
    REM @ECHO OFF
    :retry
    for %%F in (C:\input\*.tif) do (
    E:\ocr2any\ocr2any.exe -ocr2 "%%F" "D:\output\%%~nF.xls"
    move "%%F" "%%F.bak"
    )
    ping -n 5 127.0.0.1 > nul
    goto retry

    ------------------------------------------------

  • Following command line will OCR all PDF files in D:\temp\ folder to text files:

      for %F in (D:\temp\*.pdf) do ocr2any.exe -ocr -lang deu "%F" "%~dpnF.txt"

    Following command line will OCR all PDF files in D:\temp\ folder and subdirectories to text files:

      for /r D:\temp %F in (*.pdf) do ocr2any.exe -ocr "%F" "%~dpnF.txt"

    Following command line will OCR all PDF files from D:\temp\ folder and output text files to C:\test folder:

      for %F in (D:\temp\*.pdf) do ocr2any.exe -ocr "%F" "C:\test\%~nF.txt""

    Following command lines will use Enhanced OCR options:

      for %F in (D:\temp\*.pdf) do ocr2any.exe -ocr2 -lang deu "%F" "%~dpnF.txt"
      for %F in (D:\temp\*.pdf) do ocr2any.exe -ocr2 -lang eng "%F" "%~dpnF.doc"
      for %F in (D:\temp\*.tif) do ocr2any.exe -ocr2 "%F" "%~dpnF.doc"
      for %F in (D:\temp\*.tif) do ocr2any.exe -ocr2 -ocr2autorotate "%F" "%~dpnF.xls"
      for /r D:\temp %F in (*.pdf) do ocr2any.exe -ocr2 "%F" "%~dpnF.rtf"
      for %F in (D:\temp\*.pdf) do ocr2any.exe -ocr2 "%F" "C:\test\%~nF.html""
      ocr2any.exe -ocr2 D:\temp\*.tif D:\temp\*.html
      ocr2any.exe -ocr2 -ocr2excelmode 0 D:\temp\*.pdf D:\temp\*.xls
      ocr2any.exe -ocr2 D:\temp\*.png D:\temp\*.rtf
      ocr2any.exe -ocr2 D:\temp\*.tif D:\temp\*.csv
      ocr2any.exe -ocr2 D:\temp\*.pdf D:\temp\*.doc

  • When you need to convert PDF to Excel, simply change the file extension in the above command line template. By this method, the files in specified folder will be converted to word or Excel automatically.

  • The above command line is for converting text based PDF file to Word or Excel. When you need to convert image PDF to word or Excel, please add OCR parameters in the above command line template. Here are some parameters for your reference:


    -ocrmode
    <int>          : set OCR mode
      -ocrmode 0: output to text file
      -ocrmode 1: OCR PDF pages and insert new text layer under original PDF pages
      -ocrmode 2: output to plain text based PDF file
      -ocrmode 3: output to OCRed PDF file (BW) with hidden text layer
      -ocrmode 4: output to OCRed PDF file (Color) with hidden text layer

We will try to make the GUI version to OCR PDF to word or Excel automatically as soon as possible. For now you can use the command line version. During the using, if you have any question, please contact us immediately.

VN:F [1.9.20_1166]
Rating: 0.0/10 (0 votes cast)
VN:F [1.9.20_1166]
Rating: 0 (from 0 votes)

Related Posts

Leave a Reply

Your email address will not be published. Required fields are marked *


Verify Code   If you cannot see the CheckCode image,please refresh the page again!