Hello Team,
Can you providing us a script or API in PHP that will read the text of a specific area of a pdf or a image means most highlighted text from a pdf or image.
If yes then what will be the costing.
I m sending example image.
Thank you
Customer
--------------------------------------
Hope you are doing well.
I am looking for a service where:
1. Will send the scanned pdf via API call
2. Your API will extract data of specific area/location
3. Return the data to us via API
Talk to you soon.
Customer
--------------------------------------
VeryPDF OCR Cloud API has this function, please look at following web pages for more information,
https://www.verypdf.com/wordpress/201407/how-to-use-ocr-cloud-api-to-extract-text-positions-or-coordinates-from-tiff-image-files-40806.html
https://www.verypdf.com/wordpress/201308/verypdf-cloud-api-platform-verypdf-ocr-cloud-api-online-ocr-engine-to-recognize-scanned-pdf-and-image-files-to-editable-document-formats-37980.html
https://www.verypdf.com/wordpress/201406/how-to-use-verypdf-cloud-api-ocr-sdk-for-automation-and-getting-coordinate-position-of-text-on-image-40675.html
VeryPDF OCR Cloud API is $19.95 / Month, you can subscribe it from following web page,
https://www.verypdf.com/online/cloud-api/try-and-buy.html
We have implemented "Extract text from image rectangles" function some months ago, we have added a "rectangle" parameter to OCR characters in a rectangle on image, you can use it like below,
http://online.verypdf.com/api/?apikey=XXXXXXXXXXXXXXXX
&app=ocr
&infile=https://dl.dropboxusercontent.com/u/5570462/49AD37032CCC2C0_newfilename10.tif
&format=1
&dumpwordpos=1
&lang=swe
&rectangle=200×1674+822+379
the meaning of “200×1674+822+379” is,
200 is width,
1674 is height,
822 is left position,
379 is top position,
You should better use urlencode() function to encodes string when you call this URL from PHP code, e.g.,
<?php
function get_data($url)
{
$ch = curl_init();
$timeout = 5;
curl_setopt($ch, CURLOPT_URL, $url);
curl_setopt($ch, CURLOPT_RETURNTRANSFER, 1);
curl_setopt($ch, CURLOPT_CONNECTTIMEOUT, $timeout);
$data = curl_exec($ch);
curl_close($ch);
return $data;
}
$strURL = 'http://online.verypdf.com/api/?apikey=XXXXXXXXXXXXXXXX&app=ocr&infile=';
$strURL .= 'https://dl.dropboxusercontent.com/u/5570462/49AD37032CCC2C0_newfilename10.tif';
$strURL .= '&format=1&dumpwordpos=1&lang=swe';
$strURL .= '&rectangle=' . urlencode('200x1674+822+379');
$returned_content = get_data($strURL);
echo $returned_content;
?>
You can use "rectangle" option to get characters from a special rectangle on image file easily.
VeryPDF
Related Posts
- VeryPDF OCR Server – High Accuracy OCR Software for Enterprise Document Conversion, PDF/A Archiving, and Automated Batch Scanning
- VeryPDF Cloud OCR SDK for Document Processing: Extract any data from any document, spreadsheet, and more
- VeryPDF Cloud OCR (WebOCR, Online OCR) REST API converts scanned PDF, TIFF, PNG, JPG files to text and OCRed PDF files using Web REST API
- What is necessary to use this Cloud API on a website that handles mostly pdf files? PDF REST API Reference, WEB SERVICE SOAP and REST Cloud API
- VeryPDF WebAPI is a Best PDF Cloud API Platform for Web and Mobile Developers
- How to use VeryPDF Cloud OCR API to OCR typical Invoice by a template?
- VeryPDF Cloud OCR API does convert scanned PDF files to text files, it can also extract text positions from scanned PDF files
- VeryPDF PDF Form Filler- fill PDF form free online
- How to compress PDF through PHP code?
- How to upload standard PDF form allowing users to fill out directly?
- How to use VeryPDF Online OCR API to Convert PDF to TXT from my .net code? Free WebOCR API and SDK for C#, VB.NET, ASP.NET, Java, C++, VB, etc. developers. Royalty Free Invoice OCR SDK, extract items from scanned TIFF, JPG, PNG and PDF invoice documents.
- I would like to occasionally convert a website to a PDF via your web interface, how do I remove demo watermark from generated PDF files?