I recently purchased a copy of PDF2TXT as I needed to convert PDF to text and keep the formatting. I tried it on some documents and it seems to work ok, I’ve just realised that it is failing on most. The format of the PDF has obviously something strange as I tried it with Adobe and it is also failing to save the pdf as text.
I’ve attached a few sample files for you to have a look at.
================================
We have double checked your PDF file just now, your PDF file contains some embedded fonts, the characters which render by embedded fonts can't be copied out, you may open this PDF file in Adobe Reader, press CTRL+A, CTRL+C to copy all text contents, and press CTRL+V to paste them into notepad, you will notice that you can't copy out the readable text contents from this PDF file. Our PDF2TXT can't convert this PDF file to readable text file too, please understand this matter.
Please refer to No.4 item in FAQ list,
https://www.verypdf.com/pdf2txt/support/index.html#4
Additionally, you can download "PDF to Text OCR Converter Command Line" product from our website to try, "PDF to Text OCR Converter Command Line" can convert this type PDF file to text file properly,
https://www.verypdf.com/pdf2txt/pdf-to-text-ocr-converter.htm
e.g.,
pdf2txtocr.exe -ocr D:\temp3\xp5.pdf D:\temp3\xp5.txt
Related Posts
Related posts:
PDF to Text OCR Converter Command Line v2.0
PDF to Text Converter can't extract text which render by embedded fonts
PDF to Text Converter can’t align text lines
How to convert bmp file to word?
How to convert scanned PDF to XLSX
How to convert scanned German PDF to XLS of Excel
How to extract German text from scanned PDF to XLS
VeryPDF PDF Extract allows you to extract content from PDF files and save it in a structured data fo...
Hi,
I am looking to convert the PDF file to XML. How can I do that.
I am also looking to take the info form PDF and scan that info (all the fields like company name, product name etc) my database. Any solution available.
Thanks,
Mohammed Aejaz
Please download following products from our website to try,
http://www.verydoc.com/pdf2xmlsdk.html
http://www.verydoc.com/pdfparsersdk.html
these products are all can convert PDF files to XML files, we hoping these products will useful to you.