ocr products

Convert Scanned PDF file to XML format

We are looking for a solution to convert 10 bankers boxes of legal documents. Most of the information, after conversion, is received by an external e-Discovery culling, tagging and production tool like Relativity or Summation.

What would you charge? See the attached requirements… Also we may just want to purchase the software and do it ourselves in-house. What would that charge be?
===============================
Do you wish convert this scanned PDF file to editable Word document? If yes, we suggest you may download mini PDF to Word OCR Converter v3.2 from following web page to try, you can use mini PDF to Word OCR Converter v3.2 to convert your scanned PDF file to editable Word document easily,

http://www.minipdf.com/pdf2wordocr.htm

VeryPDF
===============================

Hello, we need to convert PDF to TIFF CCITT group 4, TIFF to Compressed JPEG, and OCR with text file. Or we would prefer to scan to PDF and save as XML and then run a conversion to all the above formats.

===============================

Thanks for your message, the following products may helpful to you,

VeryDOC PDF to Image Converter, you can use this product to convert PDF file to TIFF CCITT group 4 format,

http://www.verydoc.com/pdf-to-image.html

PDF to Text OCR Converter Command Line, you can use this product to convert scanned PDF file to text file,

https://www.verypdf.com/pdf2txt/pdf-to-text-ocr-converter.htm

however, we can't convert Scanned PDF to XML format directly, we haven't this product yet, but our OCR and PDF Render technology are ready, we can develop a Scanned PDF to XML Converter product to you within a few business days, we will send an email to you to talk the detailed technologies at later.

VeryPDF
VN:F [1.9.20_1166]
Rating: 0.0/10 (0 votes cast)
VN:F [1.9.20_1166]
Rating: -1 (from 1 vote)
verypdf blog

emf2vec does add extra white space in output PDF file when converting EMF file

I am attaching two EMF graphics and the PDF outputs generated with the command line emf2vec -rclbounds 208614_x.emf 208614_x.pdf. We are seeing extraneous white-space at the top of the graphic (we do not see this in the originals using several viewers, screen shot from Microsoft Paint also attached ). This is causing issues with our output which no longer fits on one page due to this extra space and will require manual remediation.

Can you please look into this and let us know what the issue is?

emf2vec -v
Thank you for choosing our product.
VeryDOC EMF To Vector Converter v2.0
Web: https://www.verypdf.com
Web: http://www.verydoc.com
Email: support@verypdf.com
Release Date: Apr  9 2009

Thanks again.
============================================
Without that DLL is place we found issues with a couple of graphics. I am attaching the one that I can find. When we run emf2vec -rclbounds on this graphic the resulting pdf is zero bytes and it takes a long time to run. A screen shot is attached along with the input and output. We ran several hundred conversions and only had failure on 3 I believe, but there is definitely something wrong. Having the DLL in place seems to solve the problem, so for now this is not causing us problems.

Thanks again.
============================================

Thanks for your sample files, this is a minor problem, we have solved this problem to you, please download the new version to try again.

VeryPDF
VN:F [1.9.20_1166]
Rating: 0.0/10 (0 votes cast)
VN:F [1.9.20_1166]
Rating: 0 (from 0 votes)
html converter (htmltools), html to image converter

Controlling image height in VeryPDF HTML Converter v2.0

Hi support,

I have a question to your VeryPDF HTML Converter v2.0

Your demo always produce an output of 801 x 601px. when creating WMF/JPEG from a HTML file. I am interested to control the width by setting up a fix value and then let your software calculate the height depending on how much text there is included in the HTML data.

What I currently get is this (where the border represent the size of the image):

What I would like is this:

Another example.

Current result:

I need this result:


Is there any way to control this?

Thanks for your time and some very nice tools.
====================================================
Thanks for your message, HTML Converter v2.0 hasn't an option to crop the height of output WMF/JPEG file, however, we have technology to crop the margins from both image (TIFF, TIF, JPEG, JPG, PNG, PCX, etc.) and PDF files, we will add this function into the future releases of HTML to PDF Converter product.

VeryPDF
VN:F [1.9.20_1166]
Rating: 0.0/10 (0 votes cast)
VN:F [1.9.20_1166]
Rating: 0 (from 0 votes)
pdf form filler

Fill a PDF form programmatically

Hello, id like to know if i can fill a PDF form, programmably.
I need to run a .NET application on an (internet) server that is able to fill in specific PDF forms provided with a given data.
Is one of your products has the needed API for such a task ?
I'll be happy to know,
=====================================
Yes, you can use our PDF Form Filler SDK to insert XML or FDF or XFDF into PDF files, you can convert your database or data contents to FDF or XML or XFDF format first, then you can use PDF Form Filler SDK to insert this FDF or XML or XFDF into PDF file easily, you can download the trial version of PDF Form Filler SDK from following page to try,

https://www.verypdf.com/pdfform/index.html

VeryPDF
VN:F [1.9.20_1166]
Rating: 0.0/10 (0 votes cast)
VN:F [1.9.20_1166]
Rating: 0 (from 0 votes)
pdf editor, pdfstamp command line

Help to remove page stamp

I need to remove the page stamp that we have previously added to a batch of documents using very pdf.

Can this be done with very pdf?
==========================
Thanks for your message, you can use our PDF Editor product to remove the stamps, you can open your PDF file in PDF Editor, click "Edit Content" button, then you can remove the stamps one by one by manual, PDF Editor product can be downloaded from following page,

https://www.verypdf.com/pdf-editor/index.html

Additionally, you can also use PDFStamp Command Line to remove these stamps, PDFStamp Command Line product has an "undo" parameter, you can use PDFStamp Command Line product to undo the stamps from your PDF files on the fly, PDFStamp Command Line product can be downloaded from following page,

https://www.verypdf.com/pdfstamp/index.htm#dl

You can run following command line to undo stamps from your PDF file, for example,

pdfstamp -PDF "test.pdf" -o undo.pdf    -SU

VeryPDF
========================================== 

We have downloaded the command line software but we are still unable to remove the image stamp.

I have attached a sample document that contains the type of image stamp that we are trying to remove.  The image stamp on the sample document was applied using very pdf approximately 12 months ago.

Can you please advise how I can remove the image stamp from a large batch of documents?
=============================================
You can run following command line to remove the stamps from your PDF file,

pdfstamp.exe -pdf D:\temp5\VET.001.001.0016.pdf -o D:\temp5\VET.001.001.0016-undo.pdf -SU

please refer to the new PDF file in attachment, this PDF file hasn't watermark.

VeryPDF
=============================================
Thank you very much for your response.

The sample file is one of 4611 files that we need to remove the watermark from.  Can you please tell me how I can remove the watermark from all files in the batch?
=============================================

You can run following command line to batch remove the stamps from your PDF files,

pdfstamp.exe -pdf D:\in\*.pdf -o D:\out\*.pdf -SU

VeryPDF
=============================================

I've downloaded the trial version 2.5 of pdf stamp command line to try before buying it, but even using the attached command I still cannot remove the stamp. Could you tell me if I am doing anything wrong?

pdfstamp -PDF "VET0010010002.pdf" -o undoVET0010010002.pdf -SU

Running on Windows 7 32 bits
==============================
VET0010010002.pdf file was modified by other applications (Adobe Acrobat or others), because "verypdf" keyword has been removed from this PDF file, so, it is impossible to remove the stamps from this PDF fileby "-su" parameter, please understand this matter.

However, if this function is important to you, we can provide a custom-build version of PDF Text Remover Command Line product to you, you can use this product like following,

pdftxtdel.exe -text "VET.001.001" C:\in\*.pdf C:\out\*.pdf

Above command line will remove all text contents which start with "VET.001.001" string, it can remove any text string from PDF pages. If this solution will acceptable to you, please feel free to let us know, we will talk the details.

VeryPDF

 

 

VN:F [1.9.20_1166]
Rating: 0.0/10 (0 votes cast)
VN:F [1.9.20_1166]
Rating: 0 (from 0 votes)