Extracting information from a PDF

Question:I am doing a university project and it based on "Extracting Information from PDF". My idea is to:

  • search and find out correct PDF using the text which I input. eg - there are lots of jumbled PDF in hard disk and I want to select pdf regarding "Artificial Intelligent"
  • My searching query is also "Artificial Intelligent" I also need to extract the content of Artificial intelligent content inside in the PDF the content relevant to my input query will display in the interface finally. Is there any solution on VeryPDF to solve this problem I am kindly looking forward student.

Answer: According to your needs, maybe you can have a free trial of this software: VeryPDF Advanced PDF Tools Command Line, by which you can extract data from PDF in batch. Say you can input the whole folder PDF file, and then all the PDF basic information will be shown in the MS Dos Windows. Now you can pick up those with Artificial Intelligent". By this method, you do not need to check PDF file one by one, you can pick up PDF related to your subject. Please check more information of this software on homepage, in the following part, let us check how to use this software.

Step 1. Free download Advanced PDF Tools Command Line

  • As this is command line version software, when downloading finishes, there will be a zip file. Please extract it to some folder then you can check help document and call it from MS Dos Windows.
  • When you use this software, please refer to the usage and examples.

Step 2. Extracting information from PDF.

  • Here is the usage for your reference:pdftools [options] { [-i ] "input-file" } "output-file"
  • When you need to extract data from PDF, please refer to the following command line template:
  • Show PDF file information
    -r: option -r is to show the detail information of selected PDF files, including file name, PDF version, security, file page count, title, author, etc.
    pdftools -i "C:\input.pdf" –r    
    where the "C:\input.pdf" is the file's path name.
    When you need to extract data in batch of the whole folder, please use it like this:
    pdftools -i "C:\*.pdf" –r
    The following snapshot is from the MS Dos Windows, please have a check.

    MS Dos Window showing information

  • All the PDF will be shown like this in MS Dos Windows. Now you can pick up the useful ones according to those basic information.

There are lots of functions of this software, I can not list them here. Please check more on homepage, during the using, if you have any question, please contact us as soon as possible.

VN:F [1.9.20_1166]
Rating: 0.0/10 (0 votes cast)
VN:F [1.9.20_1166]
Rating: 0 (from 0 votes)

Related Posts

Leave a Reply

Your email address will not be published. Required fields are marked *


Verify Code   If you cannot see the CheckCode image,please refresh the page again!