we have recently been looking at your PDF2Text product as need this functionality in our software. It appears that this product has a peculiar way of supporting Unicode. It looks like you need to have a text file in the folder where the DLL lives that defines each Unicode character mapping to use. This might be a deal killer for us. Could you describe this Unicode support model in more detail? Also, do you plan to support Unicode in a more general way in the near future? Thank you.
Yes, our PDF2TXT SDK does support Unicode characters, it does support English, French, German, Italian, Chinese Simplified, Chinese Traditional, Czech, Danish, Dutch, Japanese, Korean, Norwegian, Polish, Portuguese, Russian, Spanish, Swedish, Thai, etc. languages.
You can call PDF2TXTEx() function in PDF2TXT SDK product, PDF2TXTEx() function does support Unicode, please refer to following VC++ source code,
void main(int argc,char *argv)
if(argc != 3)
printf("Usage: input.pdf output.txt");
//Register your PDF2TXT SDK by given License Code
SetPageSeparator("\r\n\r\n\r\nTest for PageSeparator %PageNumber% of %PageCount%\r\n\r\n\r\n");
int iRet = PDF2TXTEx(argv, argv, 0, 0, NULL, NULL);