Daily Archives: 2013/08/01

How to extract text from a PDF within a specific rectangular region?

Question: I have to extract text from a PDF within a specific rectangular region. The work-flow is as following. First of all PDF is converted to an jpg image. Then user draws selection rectangle on top of the picture. Then … Continue reading

Posted in Table Extractor OCR | Tagged | Leave a comment

Is there a C++ library to extract text from a PDF file?

Question:Last year, I made an application in Java using PDFBox to get the raw text in some PDF files and I need to port that application to C++ now.I wanted to know what was the best C++ alternative to accomplish … Continue reading

Posted in PDF to Text OCR Command Line | Tagged | Leave a comment

How to convert multipage PDF to one PNG image file?

Sometime you may have such needs that you need to convert multipage page PDF to one image like Png file. However, we know that png image file format can not be used to show multipage .png files in one file.  … Continue reading

Posted in PDF Stitcher | Tagged | Leave a comment

Converting a multiple-page PDF to a single image via a .NET application

Question:I'm attempting to convert a PDF into a single image using GhostScript.Only the first page is converted, while my intention is to generate a horrendously tall PNG/JPG image with all the pages concatenated together. Is it possible to concatenate all … Continue reading

Posted in PDF to Image Converter | Tagged | Leave a comment