I want to buy PDF2TXT COM and use my project, but I noticed different effect of using sdk and test.exe application.
I attached part of input file as an image.
First I used test.exe and result is attached (exe.txt). Columns in the table are straight (except header, but for me it’s not a problem).
Next I tried extract the same file using skd:
Pdf2TxtNativeMethods.PDF2TXTSetLicenseCode("XXXXXXXXXXXXXX");
Pdf2TxtNativeMethods.SetTXTFormat(1);
Pdf2TxtNativeMethods.PDF2TXT(fileName, outFile);
Result is in the API.txt file. Last tree columns are scattered.
The problem is related with Polish special characters.
Is possible to change setting of sdk component to read pdf exactly same like console version do?
Can component read polish special characters?
=============================
Please call PDF2TXTEx() function to instead of PDF2TXT() function to try again, PDF2TXTEx() function does support polish special characters, for example,
Pdf2TxtNativeMethods.PDF2TXTSetLicenseCode("XXXXXXXXXXXXXX");
Pdf2TxtNativeMethods.SetTXTFormat(1);
Pdf2TxtNativeMethods.PDF2TXTEx(fileName, outFile, 0, 0, 0, 0);
VeryPDF
VN:F [1.9.20_1166]
Rating: 0.0/10 (0 votes cast)
VN:F [1.9.20_1166]
Related Posts
- Highly accurate OCR server software designed to automate high volume conversion of scanned paper and image documents to searchable PDF
- VeryPDF PDF Extract allows you to extract content from PDF files and save it in a structured data format
- Efficient and Accurate EMF to Text Conversion with VeryPDF Command Line Converter
- Powerful VeryPDF PDF Conversion SDK for Developers: Convert PDF, Word, Excel, PowerPoint, HTML, and More!
- Intelligent PDF Data Extraction with VeryPDF Data Extraction SDK: JSON Output, Table Extraction, and More
- Convert PDF to Text with VeryPDF PDF to Text SDK for Windows, Linux, Mac, iOS, Android platforms
- VeryPDF PDF SDK for Web & Windows & Linux & Mac & iOS & Android as well as PDF Conversion SDK
- VeryPDF Text and Image Extraction Toolkit is a developer product for reliably extracting text, images and metadata from PDF documents
- Full Text Extraction with VeryPDF PDF to Text OCR SDK for .NET
- PDF to Text OCR Converter SDK for .NET, C# OCR SDK, OCR API, OCR Library for .NET Developers Royalty Free
- PDF to TEXT Converter SDK and EXE versions
- PDF2TEXT Product and Unicode
- PDF Content Grabber does grab text, image, graphics contents from PDF files
- Page breaker symbol in PDF to Text Converter
- Convert PDF to Office Formats with VeryPDF PDF to Office Conversion SDK to Developers Royalty Free