How can I extract meta info from input PDF file to the given output file?

What is meta info?
PDF meta info is the information such as title, author, creator, subject, keywords, and producer. If you have Adobe products like Adobe Acrobat and Adobe Reader, you can use any of them to find the meta info by opening the properties dialog box and then view and modify the PDF meta info.

Here is how:
To extract meta information, you can use VeryPDF PDF Toolboxwhich does not require Adobe product or any other third party application. The ASP.NET, C#, and VB.NET samples below show how to extract meta information from PDF files to a given output file.

C# code:
using System;
using System.Collections.Generic;
using System.Linq;
using System.Text;
namespace Call_PDFToolBoxCom
{
    class Program
    {
        static void Main(string[] args)

        {
            string strExePath = System.AppDomain.CurrentDomain.SetupInformation.ApplicationBase;
            PDFToolBoxCom.PDFToolBoxCom p = new PDFToolBoxCom.PDFToolBoxCom();
            string strReturn = p.RunCMD(strExePath + @"\in.pdf -getinfo -outfile "+ strExePath +@"\report.txt");
            Console.WriteLine(strReturn);
            Console.WriteLine("Please enter any key to continue...");
            Console.ReadKey();
        }
    }
}

VB_NET code:
Module Module1
    Sub Main()
        Dim p As PDFToolBoxCom.PDFToolBoxCom
        Dim strRet As String
        Dim strParam As String
        Dim strPath As String
        strPath = AppDomain.CurrentDomain.SetupInformation.ApplicationBase
        strParam = strPath + "pdftoolbox.exe " + strPath + "in.pdf" + " -getinfo -outfile" + strPath + "report.txt"
        Console.WriteLine(strParam)
        p = New PDFToolBoxCom.PDFToolBoxCom()
        strRet = p.RunCMD(strParam)
        Console.WriteLine(strRet)
        Console.WriteLine("Please enter any key to continue...")
        Console.ReadKey()
    End Sub
End
Module

ASP_NET code:
using System;
using System.Collections.Generic;
using System.Linq;
using System.Web;
using System.Web.UI;
using System.Web.UI.WebControls;
namespace ASP_NET_Call_PDFToolBoxCom
{
    public partial class _Default : System.Web.UI.Page       
    {
        protected void Page_Load(object sender, EventArgs e)
        {}
        protected void Button1_Click(object sender, EventArgs e)
        {
            Console.WriteLine("UserName: {0}", Environment.UserName);
            PDFToolBoxCom.PDFToolBoxCom p = new PDFToolBoxCom.PDFToolBoxCom();
            string strPath = @"E:\bin\pdftoolbox.exe";
            strPath += @" in.pdf -getinfo -outfile report.txt";
            string strRet = p.RunCMD(strPath);
            Response.Write(strRet);
        }
    }
}

 

______________________
Lear more: VeryPDF PDF Toolbox Component for .NET.

VN:F [1.9.20_1166]
Rating: 0.0/10 (0 votes cast)
VN:F [1.9.20_1166]
Rating: 0 (from 0 votes)

Related Posts

This entry was posted in VeryPDF Blog and tagged , , , , , , , . Bookmark the permalink.

Leave a Reply

Your email address will not be published. Required fields are marked *


Verify Code   If you cannot see the CheckCode image,please refresh the page again!