I want to buy PDF2TXT COM and use my project, but I noticed different effect of using sdk and test.exe application.
I attached part of input file as an image.
First I used test.exe and result is attached (exe.txt). Columns in the table are straight (except header, but for me it’s not a problem).
Next I tried extract the same file using skd:
Pdf2TxtNativeMethods.PDF2TXTSetLicenseCode("XXXXXXXXXXXXXX");
Pdf2TxtNativeMethods.SetTXTFormat(1);
Pdf2TxtNativeMethods.PDF2TXT(fileName, outFile);
Result is in the API.txt file. Last tree columns are scattered.
The problem is related with Polish special characters.
Is possible to change setting of sdk component to read pdf exactly same like console version do?
Can component read polish special characters?
=============================
Please call PDF2TXTEx() function to instead of PDF2TXT() function to try again, PDF2TXTEx() function does support polish special characters, for example,
Pdf2TxtNativeMethods.PDF2TXTSetLicenseCode("XXXXXXXXXXXXXX");
Pdf2TxtNativeMethods.SetTXTFormat(1);
Pdf2TxtNativeMethods.PDF2TXTEx(fileName, outFile, 0, 0, 0, 0);
Related Posts
Related posts:
Can't call PDF2TXT SDK from ASP code
PDF to Text Converter Fail to convert pdf
Convert PDF to text using advanced OCR technology
How to convert PDF to text when there is some type of encryption?
Command Line Tool for Extracting Text Coordinates in PDF
VeryPDF PDF Parser & Modify Component for .NET Developer License failed to extract characters wh...
Cloud PDF Data Extractor does extract data from PDF invoices and automate your business
Convert PDF to XML and SVG with VeryPDF PDF Extract Tool Command Line for Data Extraction and Automa...