VeryPDF PDF Extract Tool Command Line is a great software which can be used to extract text/word positions from PDF files, you may download the trial version from this web page to try,
https://www.verypdf.com/app/pdf-extract-tool/index.html
https://www.verypdf.com/dl2.php/verypdf_pdf_extract_tool.zip
Here are some questions from VeryPDF's customers,
Hi,
Is there a way to turn off the creation of the XML and TTF font files? These take a long time to be created and I am only interested in the 'TextWithPosition.txt' output file.
Thanks
Customer
------------------------------------
How to Extract Text coordinates from PDF? Can anyone tell me how to get coordinates in pdf document using VB or .NET, suppose if some text is written in pdf document then how can i get coordinates of that text. Its very Urgent.
Customer
------------------------------------
Here I am want to extract the text from PDF file based on coordinates given by me (ie: Rectangle coordinates value) with Bold Italic style by VB.NET.
I want to extract text from PDF file based on Rectangle coordinates value. While extracting text I want to retrain the style applied to that text (ie: Bold, Italic) if possible including color and font name.
Those thing are I want to achieve by using VB.NET.
Customer
------------------------------------
You can run following command line to extract only text and positions, for example,
pdfextract.exe -$ "XXXXXXXXXXXXXXXX" -textpos test-form.pdf _test-form-pos.txt
You can call pdfextract.exe from C#, VB and VB.NET program languages. pdfextract.exe will create a text file with positions/coordinates like below, you can read this text file into your application and analyze it easily,
Thank you.
VeryPDF