VeryPDF Layout Analysis SDK can be downloaded from following web page,
https://www.verypdf.com/app/papertools/try-and-buy.html
https://www.verypdf.com/dl2.php/papertoolssdk.zip
VeryPDF Layout Analysis SDK allows to analyze the layout of any document using complex algorithms, able to recognize with high accuracy the different kind of areas in the page.
VeryPDF Layout Analysis SDK identifies the following types of areas:
- Text
- Inverted text
- Noise
- Images (pictures or drawings)
- Tables (rows, columns and cells)
- Horizontal and Vertical Lines
After the layout analysis recognition, it's possible to operate a sub-classification defining some rules according to the kind of document to analyze. For example, on a newspaper page, we could recognize a text area as "Title" or "Header" or "Footer".
The following is a C# source code example to execute Layout Analysis to a scanned image file,
string appPath = Path.GetDirectoryName(Application.ExecutablePath);
System.Type VeryPDFType = System.Type.GetTypeFromProgID("VeryPDF.PaperToolsCom");
VeryPDF.PaperToolsCom VeryPDFCom = (VeryPDF.PaperToolsCom)System.Activator.CreateInstance(VeryPDFType);
string appFolder = Path.GetDirectoryName(Application.ExecutablePath);
string strFolder = Directory.GetParent(appFolder).FullName;
string strReturn = "";
int nFileIndex = 0;
VeryPDFCom.EnableDebugLog(true);
string strInFile = strFolder + "\\sample\\test_table_ocr.tif";
string strOutFile = strFolder + "\\sample\\output\\_output_" + nFileIndex.ToString() + ".png";
string strCmd = "-$ XXXXXXXXXXXXXXXXXX -layout \"" + strInFile + "\" \"" + strOutFile + "\"";
strReturn = strReturn + VeryPDFCom.PaperTools(strCmd);
If you encounter any problem with VeryPDF Despeckle SDK, please feel free to let us know.