I want to evaluate your "Image2PDF OCR Engine" for compression ratio and quality of your OCR engine.
However, I am not able to process my sample Images with the Trail Version.
Could you please help or provide a trail version of your command line tool so that I can complete my evaluation.
==============================
Thanks for your message, you can download PDF OCR Compressor Command Line product from our partner's web page to try,
PDF OCR Compressor Command Line does support JBIG2 and JPEG2000 compression, it can also OCR your PDF and TIFF files to searchable PDF files.
You need a DLL version for PDF OCR Compressor, you can download PDF OCR Compressor Toolkit (SDK) from following web page,
PDF OCR Compressor Toolkit (SDK) is a DLL Library, it is contain all functions in PDF OCR Compressor Command Line product, you can look at all functions in PDF OCR Compressor Toolkit (SDK) from following web page,
==============================
Thanks for your prompt reply. I looked at the "PDF OCR Compressor Toolkit (SDK)".
For my project, I need the following functionalities:
1. OCR
2. Compression
3. PDF/A creation
4. Metadata Insertion
5. Password Protection/Permission
6. Linearization
The following site specifies that all these functionalities can be achieved.
However, SDK Manual does not specify any API to use these functionalities (apart from OCR/Compression),
Is there any other library/Manual which can satisfy my request ?
==============================
You can call Image2PDFOCR_PDFCmd() function to execute these functions,
1. OCR
2. Compression
3. PDF/A creation
4. Metadata Insertion
5. Password Protection/Permission
6. Linearization
Please refer to the statement of Image2PDFOCR_PDFCmd() function at below,
int WINAPI Image2PDFOCR_PDFCmd(char *lpszCmd) Description
Process PDF file by some special commands.
Parameters
lpszCmd
[in] Input Command Line to process the PDF file, it is support following commands,
-mergepdf: merge more PDF files into one PDF file.
Return Values
If the function succeeds, the return value is zero. If the function fails, the return value is one of following values,
-1: Input wrong command line.
-2: Can't load DLL files correctly.
Example
char szMergePDFCmd[1024] = {0};
sprintf(szMergePDFCmd, "-mergepdf C:\\allfiles.txt C:\\out.pdf"); int nRet = Image2PDFOCR_PDFCmd(szMergePDFCmd);
you can use following code to convert a normal PDF file to PDF/A file, compress your PDF files by JBIG2 and JPEG2000, Password Protection/Permission, etc.,
Image2PDFOCR_PDFCmd(“-pdfa C:\in.tif C:\out.pdf”);
Image2PDFOCR_PDFCmd("-mergepdf -pdfa C:\in.pdf C:\out.pdf");
Image2PDFOCR_PDFCmd("-dpi 100 -quality 50 -colorimg 2 -bwimg 2 -skewcorrect -despeckle -chkbwcolor C:\in.tif C:\out.pdf");
Image2PDFOCR_PDFCmd("-keylen 2 -encryption 3900 -ownerpwd 123 C:\in.tif C:\out.pdf");
You can use Image2PDFOCR_PDFCmd() function to execute all functions which included pdfocrjb2sdk.exe command line application.
VeryPDF
======================================
======================================
Thanks for your prompt reply.
Could you please let me know the exhaustive set of command options which can be used with the help of "Image2PDFOCR_PDFCmd".
Please send me the detailed specification of the command line options.
=========================
Please refer to attached readme.txt file, this readme.txt file contains all command line options which can be used by Image2PDFOCR_PDFCmd() function.
C:\>pdfocrjb2sdk.exe
Image to PDF Converter v2.1 (PDF OCR Compressor Toolkit)
Release Date: Apr 27 2011
-------------------------------------------------------
Description:
Usage: img2pdf [options] <Image-file> [<PDF-file>]
-ocr <int> : create full-text searchable PDF file
-ocrtxt <string> : export OCRed text to a text file
-ocrtxtxy <string> : export OCRed text with X, Y coordinate to a text file
-nopdf : do not create PDF file, it is useful when you use '-ocrtxt' parameter
-ocr2 <string> : this option is only available for internal use
-addtextlayerintopdf <string>: this option is only available for internal use
-xres <int> : set X resolution to image file
-yres <int> : set Y resolution to image file
-combineword : combine OCRed characters to words
-ocrrect <string> : set a rectangle region for OCR
-aligntextypos : align OCRed text contents by Y position
-lossless : lossless compression for JBIG2 and JPEG2000
-colorimg <int> : set compression method for color image
-colorimg 0: no change compression for color image file
-colorimg 1: compress color image file with JPEG arithmetic
-colorimg 2: compress color image file with JPEG2000 arithmetic
-bwimg <int> : set compression method for black and white image
-bwimg 0: no change compression for black and white image file
-bwimg 1: compress black and white image file with CCITT G4 arithmetic
-bwimg 2: compress black and white image file with JBIG2 arithmetic
-firstpg <int> : PDF to Image: first page to convert
-lastpg <int> : PDF to Image: last page to convert
-iswh : PDF to Image: specify width/height or DPI
-bitcount <int> : PDF to Image: set color depth
-autobitcount : PDF to Image: determine color depth automatically
-pidpi <int> : PDF to Image: set DPI resolution
-compress <int> : PDF to Image: set compression method to TIFF format
-compress 1 : NONE compression
-compress 2 : CCITT modified Huffman RLE
-compress 3 : CCITT Group 3 fax encoding (1d)
-compress 4 : CCITT Group 4 fax encoding
-compress 5 : LZW compression
-compress 6 : OJPEG compression
-compress 7 : JPEG DCT compression
-compress 32773 : PACKBITS compression
-compress 32809 : THUNDERSCAN compression
-multipage : PDF to Image: create multipage TIFF file
-opw <string> : owner password (for encrypted PDF file)
-upw <string> : user password (for encrypted PDF file)
-quality <int> : set quality for JPEG, JPEG2000 compression
-width <int> : set page width to PDF file
-height <int> : set page height to PDF file
-rotate <int> : rotate pages, 90, 180, 270
-view : view PDF file after creation
-skewcorrect : skew correct for black and white image file
-despeckle : despeckle for black and white image file
-chkbwcolor : check and invert color for black and white image file
-dpi <int> : set DPI resolution to output PDF file
-dpi 0 : use the default image width and height information
-dpi -1 : take DPI info from original image file
-dpi <int> : set resolution to output PDF file
-append <int> :
-append 0: overwrite if PDF file exists
-append 1: insert before first page if PDF file exists
-append 2: append to last page if PDF file exists
-append 3: rename filename if PDF file exists
-producer <string> : set 'producer' to PDF file
-creator <string> : set 'creator' to PDF file
-subject <string> : set 'subject' to PDF file
-title <string> : set 'title' to PDF file
-author <string> : set 'author' to PDF file
-keywords <string> : set 'keywords' to PDF file
-credate <string> : set 'CreationDate' to PDF file
e.g. -credate "20070116230629-08'00'"
-moddate <string> : set 'ModDate' to PDF file
e.g. -moddate "20070116230629-08'00'"
-openpwd <string> : set 'open password' to PDF file
-ownerpwd <string> : set 'owner password' to PDF file
-keylen <int> : key length (40 or 128 bit)
-keylen 0: 40 bit RC4 encryption (Acrobat 3 or higher)
-keylen 1: 128 bit RC4 encryption (Acrobat 5 or higher)
-keylen 2: 128 bit RC4 encryption (Acrobat 6 or higher)
-encryption <int> : restrictions
-encryption 0: encrypt the file only
-encryption 3900: deny anything
-encryption 4: deny printing
-encryption 8: deny modification of contents
-encryption 16: deny copying of contents
-encryption 32: no commenting
===128 bit encryption only -> ignored if 40 bit encryption is used
-encryption 256: deny FillInFormFields
-encryption 512: deny ExtractObj
-encryption 1024: deny Assemble
-encryption 2048: disable high res. printing
-encryption 4096: do not encrypt metadata
-bookmark <int> : create bookmark to PDF file
-bookmark -1: don't create bookmarks
-bookmark -2: read bookmarks from bookmark.ini file
-bookmark -3: use filenames as bookmarks
-bookmark -4: use the filenames as bookmarks, one bookmark at the first page of each tif
-bookmark <num>: "num" must >= 0, specify first number in bookmarks
-mergepdf <string> : merge two PDF files into one PDF file
-mergepdf "file1|file2|file3|...": merge several PDF files into one
-mergepdf "C:\filelist.txt": merge PDF files from a .txt file
-mergepdf "C:\*.pdf": merge PDF files into one PDF file
-burstpdf <string> : burst PDF file into single page PDF files
-pdfx : create PDF/X compliance PDF file
-pdfa : create PDF/A compliance PDF file
-log <string> : redirect output message to a log file
-nocenter : don't center the contents in the PDF file
-noempty : delete empty pages from PDF file
-linearize : optimize PDF file for fast web view
-margin <string> : set page margin to PDF file
-margin 10 : set margin to 10pt to left
-margin 10x10 : set margin to 10pt to left,top
-margin 10x10x10 : set margin to 10pt to left,top,right
-margin 10x10x10x10 : set margin to 10pt to left,top,right,bottom
-wtext <string> : watermark on printed document
-wtype <int> : type of watermark
0 : normal watermark
1 : watermark on header
2 : watermark on footer
-wf <string> : font name of watermark
-wh <int> : font size of watermark
-wb : specify bold font
-wi : specify an italic font
-wu : specify an underlined font
-ws : specify a strikeout font
-wa <int> : angle of watermark
-wbox <string> : a rectangle to output formatted text, it is only useful for "-walign" option, eg:
-wbox "0,0,595,842"
-wbox "0,0,612,792"
-walign <int> : set text align
1 : left
2 : center
3 : right
-wc <string> : color of watermark,
FF0000: Red color
00FF00: Green color
0000FF: Blue color
HexNum: Other colors
-wx <int> : X offset of watermark
-wy <int> : Y offset of watermark
-quick : convert image to PDF without processing
-usegs : use Ghostscript to convert PDF file to image file
-getpagecount : retrieve page count from PDF file
-v : print copyright and version info
-h : print usage information
-help : print usage information
--help : print usage information
-? : print usage information
-$ <string> : input registration key
Example:
pdfocrjb2sdk.exe C:\in.jpg C:\out.pdf
pdfocrjb2sdk.exe C:\in.tif C:\out.pdf
pdfocrjb2sdk.exe -pdfa C:\in.tif C:\out.pdf
pdfocrjb2sdk.exe -pdfx C:\in.tif C:\out.pdf
pdfocrjb2sdk.exe -linearize -pdfa C:\in.tif C:\out.pdf
pdfocrjb2sdk.exe C:\*.tif C:\*.pdf
pdfocrjb2sdk.exe -margin 100x100x100x100 C:\in.tif C:\out.pdf
pdfocrjb2sdk.exe -append 2 -width 612 -height 792 C:\*.tif C:\out.pdf
pdfocrjb2sdk.exe -append 2 C:\*.tif C:\out.pdf
pdfocrjb2sdk.exe C:\filelist.txt C:\out.pdf
pdfocrjb2sdk.exe -width 612 -height 792 C:\in.tif C:\out.pdf
pdfocrjb2sdk.exe -append 1 C:\in.tif C:\out.pdf
pdfocrjb2sdk.exe -append 2 C:\in.tif C:\out.pdf
pdfocrjb2sdk.exe -subject "subject" C:\in.tif C:\out.pdf
pdfocrjb2sdk.exe -ownerpwd 123 -keylen 2 -encryption 3900 C:\in.tif C:\out.pdf
pdfocrjb2sdk.exe -bookmark 3 C:\in.tif C:\out.pdf
pdfocrjb2sdk.exe -mergepdf "C:\1.pdf|C:\2.pdf|C:\3.pdf" C:\out.pdf
pdfocrjb2sdk.exe -mergepdf "C:\*.pdf" C:\out.pdf
pdfocrjb2sdk.exe -mergepdf "C:\filelist.txt" C:\out.pdf
Add watermarks into PDF files:
pdfocrjb2sdk.exe -wtext "ImagePDF" "C:\in.tif" C:\out.pdf
pdfocrjb2sdk.exe -wtext "ImagePDF" -wc "0000FF" "C:\in.tif" C:\out.pdf
pdfocrjb2sdk.exe -wtext "ImagePDF" -wx 100 -wy 100 "C:\in.tif" C:\out.pdf
pdfocrjb2sdk.exe -wtext "ImagePDF" -wtype 1 "C:\in.tif" C:\out.pdf
pdfocrjb2sdk.exe -wtext "ImagePDF" -wtype 2 "C:\in.tif" C:\out.pdf
pdfocrjb2sdk.exe -wtext "ImagePDF" -wtype 0 -wa 45 "C:\in.tif" C:\out.pdf
pdfocrjb2sdk.exe -wtext "ImagePDF" -wf "Arial" "C:\in.tif" C:\out.pdf
pdfocrjb2sdk.exe -wtext "ImagePDF" -wf "Arial" -wh 20 "C:\in.tif" C:\out.pdf
pdfocrjb2sdk.exe -wtext "ImagePDF" -wf "Arial" -wh 20 -wb -wi -wu -ws "C:\in.tif" C:\out.pdf
pdfocrjb2sdk.exe -wtext "ImagePDF" -wf "Arial" -walign 3 -wh 20 -wbox "0,770,612,792" "C:\in.tif" C:\out.pdf
Convert PDF file to image file:
pdfocrjb2sdk.exe C:\in.pdf C:\out.tif
pdfocrjb2sdk.exe -firstpg 10 -lastpg 20 C:\in.pdf C:\out.tif
pdfocrjb2sdk.exe -dpi 300 -bitcount 1 C:\in.pdf C:\out.tif
pdfocrjb2sdk.exe -iswh -width 800 -height 600 C:\in.pdf C:\out.tif
pdfocrjb2sdk.exe -dpi 300 -bitcount 1 -multipage C:\in.pdf C:\out.tif
pdfocrjb2sdk.exe -dpi 300 -bitcount 1 -compress 4 C:\in.pdf C:\out.tif
pdfocrjb2sdk.exe -opw "123" -upw "456" C:\in.pdf C:\out.tif
C:\>
Related Posts
Related posts:
Issue running pdf extractor as scheduled task
Use PDF OCR Compressor Toolkit (SDK) to convert normal PDF files to searchable PDF files
How to extract text of specified pages from image PDF to XLS
How to call Image to PDF OCR SDK from C# source code?
Convert Tiff to searchable PDF and set password to protect PDF
Extract Characters and Words with coordinates and positions from scanned PDF and TIFF files
Could not load file or assembly for ocr2anyCom.exe COM Interface. An attempt was made to load a prog...
Efficient OCR Software Development with VeryPDF OCR SDK - Affordable and Customizable Solution for D...