I recently purchased a copy of PDF2TXT as I needed to convert PDF to text and keep the formatting. I tried it on some documents and it seems to work ok, I’ve just realised that it is failing on most. The format of the PDF has obviously something strange as I tried it with Adobe and it is also failing to save the pdf as text.
I’ve attached a few sample files for you to have a look at.
We have double checked your PDF file just now, your PDF file contains some embedded fonts, the characters which render by embedded fonts can't be copied out, you may open this PDF file in Adobe Reader, press CTRL+A, CTRL+C to copy all text contents, and press CTRL+V to paste them into notepad, you will notice that you can't copy out the readable text contents from this PDF file. Our PDF2TXT can't convert this PDF file to readable text file too, please understand this matter.
Please refer to No.4 item in FAQ list,
Additionally, you can download "PDF to Text OCR Converter Command Line" product from our website to try, "PDF to Text OCR Converter Command Line" can convert this type PDF file to text file properly,
pdf2txtocr.exe -ocr D:\temp3\xp5.pdf D:\temp3\xp5.txt