Hi I am evaluating use of the PDF to Text OCR SDK for .net. Can you help answer the following questions?
1) the SDK is a v2.0 and the command line version is at v2.5 with some new features. Will the SDK have the newer functions?
2) is there documentation for the .net sdk and/or are there additional API calls.
3) does the SDK support multi-threading so we can concurrently process more than one file
4) besides the API calls, is there any functional or performance differences with the command line implementation
5) I found some pdf parsing (not OCR) issues with missing white spaces between words - is that a common issue?
I have been happy with the performance of the command line version and hope to be able to integrate the functionality into our product. Thank you.
Yes, PDF to Text OCR Converter SDK for .NET SDK is v2.0 and Command Line is v2.5, Command Line version is newer than SDK version.
But, please don't worry, after you buy PDF to Text OCR Converter SDK for .NET version, our engineer will send PDF to Text OCR Converter SDK for .NET v2.5 to you free, you will able to use the latest version without any problem.
>>2) is there documentation for the .net sdk and/or are there additional API calls.
PDF to Text OCR Converter SDK for .NET supports all options which available in PDF to Text OCR Converter Command Line software, please look at attached readme.txt file and following web page,
If you encounter any problem with PDF to Text OCR Converter SDK for .NET package, please feel free to let us know, our engineer will assist you asap.
>>3) does the SDK support multi-threading so we can concurrently process more than one file
Yes, PDF to Text OCR Converter SDK for .NET supports multi-threading, you can convert more files at same time.
>>4) besides the API calls, is there any functional or performance differences with the command line implementation
"PDF to Text OCR Converter SDK for .NET" and "PDF to Text OCR Converter Command Line" are based on same source code, their performances are no difference, the only difference is the API Call and Command Line Call.
>>5) I found some pdf parsing (not OCR) issues with missing white spaces between words - is that a common issue?
This is not a common issue, you may send to us this PDF file and tell us where is the problem, our engineer will check it and come back to you asap.
VeryPDF .NET PDF to Text Converter SDK is a mature and stand-alone .NET library component for PDF text extraction and PDF to text conversion, no need for other .NET PDF library components, Adobe PDF reader or Acrobat SDKs. By integrating this .NET PDF converter and text extractor library into your .NET projects, like C# and VB.NET Windows Forms, ASP.NET web and Console applications, you can easily extract text from PDF document (pages) or convert entire PDF document to txt file.
Simply add .NET project reference to VeryPDF .NET PDF converter and text extractor library dll, you can use all well compiled .NET APIs and methods for text extraction from PDF and PDF to txt file conversion in your .NET application.
PDF text recognition and extraction are easy to achieve. You can get and extract text from the whole PDF document, a single page, or a range of PDF pages. Text content extracted can be saved in String Object for further usages, like search, archive and recycle.
Besides extracting text content from PDF document page, this .NET PDF converter library component allows you to directly extract all PDF pages text content and save it to a txt file. And this PDF conversion process will keep the original text format.
This PDF to text converter & extractor library can be used to extract PDF text and convert PDF to text file in C#, VB.NET Class Library, NET Windows Forms, ASP.NET web, and .NET Console applications. It is fully compatible with Visual Studio 2005 & above versions, and .NET...
If you are searching for a text extractor or PDF to text converter library for your .NET application development, please have a try with VeryPDF .NET PDF to Text OCR Converter SDK. It has mature and advanced text recognition, extraction, and conversion features. And it's quite simple to integrate and use it in your .NET applications. We provide a demo project in the free trial package for your quick reference and evaluation. Or you may see the online guide for VeryPDF .NET PDF to Text Converter SDK.
Moreover, we also provide online VB.NET & C# demo codes for your reference. Please see PDF text extraction & PDF to text conversion in C#, PDF text extraction & PDF to text conversion in VB.NET.
VeryPDF PDF to Text OCR SDK for .NET
PDF to Text OCR Converter Command Line