I am evaluating ps2txt to see if we can use it to convert our postscript
files to text files. It works when converting the test postscript file
provided with the download. But, when I try it on one of our postscript
files, no text is extracted. Only a couple of unprintable characters are in
the output. Is there an issue with postscript encoding and being able to
extract the text?
Can you please email to us your sample PS file and your Order ID (if you have)?
after we checked your sample PS file, we will figure out a solution to you shortly.
Thanks. Attached is the test file I am trying to retrieve the text from. I do not have an order Id yet. If I am able to successfully get the text from this file, I will place an order.
We have checked your “p_42947449.ps” file, this PS file is contain embedded fonts, also, the characters are garbage in this PS file, your PS file is contain lots of following text contents,
2913 6510 3F ,
2946 6507 66 69 /1F
3020 6510 64 66 /4F
3134 6510 71 87 /1J
This is the reason why you can’t convert this PS file to text file by our PS2TXT application, however, you can by following steps to convert this PS file to text file,
1. You can use our PS to PDF Converter to convert this PS file to PDF file first,
ps2pdf.exe D:\temp4\p_42947449.ps D:\temp4\p_42947449.pdf
2. You can use our “PDF to TextConverter Command Line” to convert this PDF file to text file,
pdf2txtocr.exe -ocr D:\temp4\p_42947449.pdf D:\temp4\p_42947449.txt
3. OK, you will get a text file which contains all readable text contents, please refer to converted PDF and text files in attachment.