we have recently been looking at your PDF2Text product as need this functionality in our software. It appears that this product has a peculiar way of supporting Unicode. It looks like you need to have a text file in the folder where the DLL lives that defines each Unicode character mapping to use. This might be a deal killer for us. Could you describe this Unicode support model in more detail? Also, do you plan to support Unicode in a more general way in the near future? Thank you.
===========================
Yes, our PDF2TXT SDK does support Unicode characters, it does support English, French, German, Italian, Chinese Simplified, Chinese Traditional, Czech, Danish, Dutch, Japanese, Korean, Norwegian, Polish, Portuguese, Russian, Spanish, Swedish, Thai, etc. languages.
You can call PDF2TXTEx() function in PDF2TXT SDK product, PDF2TXTEx() function does support Unicode, please refer to following VC++ source code,
void main(int argc,char *argv[])
{
if(argc != 3)
{
printf("Usage: input.pdf output.txt");
return;
}
//Register your PDF2TXT SDK by given License Code
PDF2TXTSetLicenseCode("XXXXXXXXXX");
SetPageSeparator("\r\n\r\n\r\nTest for PageSeparator %PageNumber% of %PageCount%\r\n\r\n\r\n");
int iRet = PDF2TXTEx(argv[1], argv[2], 0, 0, NULL, NULL);
}
VeryPDF
===========================
VN:F [1.9.20_1166]
Rating: 0.0/10 (0 votes cast)
VN:F [1.9.20_1166]
Related Posts
- Highly accurate OCR server software designed to automate high volume conversion of scanned paper and image documents to searchable PDF
- VeryPDF PDF Extract allows you to extract content from PDF files and save it in a structured data format
- Efficient and Accurate EMF to Text Conversion with VeryPDF Command Line Converter
- Powerful VeryPDF PDF Conversion SDK for Developers: Convert PDF, Word, Excel, PowerPoint, HTML, and More!
- Intelligent PDF Data Extraction with VeryPDF Data Extraction SDK: JSON Output, Table Extraction, and More
- Convert PDF to Text with VeryPDF PDF to Text SDK for Windows, Linux, Mac, iOS, Android platforms
- VeryPDF PDF SDK for Web & Windows & Linux & Mac & iOS & Android as well as PDF Conversion SDK
- VeryPDF Text and Image Extraction Toolkit is a developer product for reliably extracting text, images and metadata from PDF documents
- Full Text Extraction with VeryPDF PDF to Text OCR SDK for .NET
- PDF to Text OCR Converter SDK for .NET, C# OCR SDK, OCR API, OCR Library for .NET Developers Royalty Free
- PDF to Text — Convert PDF to plain text file
- Ask about Turkish caracter set in PDF to Text Converter SDK
- Create PDF from plain text and set PDF font size
- questions about PDF SDK
- How to call PDF to Text Converter SDK/COM from both 32bit and 64bit applications? How to use PDF2TXTCOM.exe COM interface?