I need a script or API in PHP that will read the text of a specific area from scanned PDF and image file

Hello Team,

Can you providing us a script or API in PHP that will read the text of a specific area of a pdf or a image means most highlighted text from a pdf or image.

If yes then what will be the costing.

I m sending example image.

Thank you
Customer
--------------------------------------
Hope you are doing well.

I am looking for a service where:
1. Will send the scanned pdf via API call
2. Your API will extract data of specific area/location
3. Return the data to us via API

Talk to you soon.
Customer
--------------------------------------
VeryPDF OCR Cloud API has this function, please look at following web pages for more information,

https://www.verypdf.com/wordpress/201407/how-to-use-ocr-cloud-api-to-extract-text-positions-or-coordinates-from-tiff-image-files-40806.html

https://www.verypdf.com/wordpress/201308/verypdf-cloud-api-platform-verypdf-ocr-cloud-api-online-ocr-engine-to-recognize-scanned-pdf-and-image-files-to-editable-document-formats-37980.html

https://www.verypdf.com/wordpress/201406/how-to-use-verypdf-cloud-api-ocr-sdk-for-automation-and-getting-coordinate-position-of-text-on-image-40675.html

VeryPDF OCR Cloud API is $19.95 / Month, you can subscribe it from following web page,

https://www.verypdf.com/online/cloud-api/try-and-buy.html

We have implemented "Extract text from image rectangles" function some months ago, we have added a "rectangle" parameter to OCR characters in a rectangle on image, you can use it like below,

http://online.verypdf.com/api/?apikey=XXXXXXXXXXXXXXXX
&app=ocr
&infile=https://dl.dropboxusercontent.com/u/5570462/49AD37032CCC2C0_newfilename10.tif
&format=1
&dumpwordpos=1
&lang=swe
&rectangle=200×1674+822+379

the meaning of “200×1674+822+379” is,

200 is width,
1674 is height,
822 is left position,
379 is top position,

You should better use urlencode() function to encodes string when you call this URL from PHP code, e.g.,

<?php

function get_data($url)
{
$ch = curl_init();
$timeout = 5;
curl_setopt($ch, CURLOPT_URL, $url);
curl_setopt($ch, CURLOPT_RETURNTRANSFER, 1);
curl_setopt($ch, CURLOPT_CONNECTTIMEOUT, $timeout);
$data = curl_exec($ch);
curl_close($ch);
return $data;
}
$strURL = 'http://online.verypdf.com/api/?apikey=XXXXXXXXXXXXXXXX&app=ocr&infile=';
$strURL .= 'https://dl.dropboxusercontent.com/u/5570462/49AD37032CCC2C0_newfilename10.tif';
$strURL .= '&format=1&dumpwordpos=1&lang=swe';
$strURL .= '&rectangle=' . urlencode('200x1674+822+379');
$returned_content = get_data($strURL);
echo $returned_content;
?>

You can use "rectangle" option to get characters from a special rectangle on image file easily.

VeryPDF

VN:F [1.9.20_1166]
Rating: 0.0/10 (0 votes cast)
VN:F [1.9.20_1166]
Rating: 0 (from 0 votes)

Related Posts

Leave a Reply

Your email address will not be published. Required fields are marked *


Verify Code   If you cannot see the CheckCode image,please refresh the page again!