How to extract pages from several PDF files and merge them into a new PDF file?

Hello,

there seems to be a serious problem with "pdf split merge v3"!? For most of my larger scanned pdf files I get strange "java errors" ("java.io.ioexception: trailer not found" and then a list of crypic hex internal addresses ... each labeled as 'unknown source') and extracting pages from the pdf is terminated (even when trying to extract one single page). I can send a screenshot - but here in this form I have no option to include a jpg file ..

Customer
----------------------------------------
Hello,

meanwhile I also tested your "VeryPDF Document Converter" Command Line tool. With it it works to extract pages from my larger scanned pdf files (unfortunately only the resulting pdf extracted file has lost the OCR informaton, seems to be a simple picture, but it is important for me that the yellow marks are kept and the pages can be extracted (with pdfsplitmerge pages from my larger pdf cannot be extracted - java error messages - and annotations - e.g. yellow marks - are list)).

Question: to solve the problem with pdfsplitmerge3 quickly - is it possible to get a serious discount for purchasing "VeryPDF Document Converter" ?

Thanks.
Customer
----------------------------------------
Hello,

to find a solution (because there is no response meanwhile) I purchased your tool "Doc Converter" cmd line as an alternative ($ 75( for the not functioning pdfplitmerge3.

But there is also an error in doc2pdf: the page range option of doc2pdf.exe seems not to work as intended. Only when I pass ONE page as an paramter (e.g. -g 5) it works as expected. When I provide a page range or a list of pages as described in the help file e.g. "-g 1,3,15,30,64", then it does NOT work and the complete source document is converted and not only the specified pages.

Customer
----------------------------------------
Please by following steps to try again,

1. Please download VeryPDF HTML Converter Command Line from following URL,

https://www.verypdf.com/htmltools/htmltools.zip

Please run following command line to register the HTML Converter Command Line product,

htmltools.exe "-$" "XXXXXXXXXXXXXXXXXXXX"

if you haven't administrator privilege in current user account, please pass the license key and other parameters together,

htmltools.exe "-$" "XXXXXXXXXXXXXXXXXXXX" "http://www.google.com" "C:\out.pdf"

You can run following command line to merge your PDF files into one PDF file without any problem,

htmltools.exe "-$" "XXXXXXXXXXXXXXXXXXXX" -mergepdf "D:\downloads\zip\1941Heft03_04_Slowakei.pdf|D:\downloads\zip\deutsche-wirtschaftsgeschichte_ein-jahrtausend_north-ambrosius-jenks-metz-schro.pdf" D:\downloads\zip\out.pdf

VeryPDF
----------------------------------------
Hello,

thanks,

but - sorry - I do not understand ...

There are serveral problems, and I do not know if the obviously NEW (other) tool you've sent (= HTMLTOOLS") does solve the problems:

Problem #1: PDFSPLITMERGE fails splitting my larger PDF files (e.g. as I have documented to you the file mentioned below 'deutsche-wirtschaftsgeschichte_ein-jahrtausend_north-ambrosius-jenks-metz-schro.pdf') . As far as I can see HTMLTOOLS does not support splitting pdf files ...

Problem#2: Merging splitted PDF files (splitted with PDFSPLITMERGE) have lost annotations

Problem#3: Your other tool - DocConverter - which I meanwhile purchased to solve the problem with merging has another serious problem: the page range option of doc2pdf.exe seems not to work as intended. Only when I pass ONE page as an paramter (e.g. -g 5) it works as expected. When I provide a page range or a list of pages as described in the help file e.g. "-g 1,3,15,30,64", then it does NOT work and the complete source document is converted and not only the specified pages. Does HTMLTOOLS support page ranges ?? Does it support long lists of inpiut files specified in an txt file as it is supported by PDFSPLIMERGE ??

Does HTMLTOOLS solve these problem ???

I need a working version of PDFSPLIMERGE !

Please respond !

Thanks
Customer
----------------------------------------
Thanks for your message, HTMLTOOLS does support #1 and #2, but it doesn't support #3, it is not support the page range.

also, I have a simple solution for you, I suggest you may open the problem PDF file in Adobe Acrobat or Adobe Reader, print it to docPrint PDF Driver, to create a new PDF file, then you can use PDFSPLIMERGE to split and merge new PDF file again, we hoping this solution will useful to you, please give it a try.

VeryPDF
----------------------------------------
Hello,

>>HTMLTOOLS does support #1 and #2,

can you please send me example command line codes for each (and for merging: how to merge a long list oft input pdf files with HTMLTOOLS which exceed the length of a command line, e.g. how to use a text file as an input source with a list of pdf files to be merged ...)

#doesn't support #3

ok, but will you fix the problem in DocConverter (the help file of DocConverter suggests that it WILL work this way, but it doesn't) ?

>also, I have a simple solution for you, I suggest
>you may open the problem PDF file in Adobe Acrobat or Adobe Reader,
>print it to docPrint PDF Driver

Sorry, this is not a solution, only an external manual workaround. And I have no idea which of my thousands of files are affected. I'm using your tools to automate my tasks. Sure, I can do all things manually mith Adobe Acrobat Pro - which works (in principle), but I have no time to do all these things manually ... I purchased your products to automate the tasks ...

Customer
----------------------------------------
>>Problem #1: PDFSPLITMERGE fails splitting my larger PDF files (e.g. as I have documented to you the file mentioned below 'deutsche-wirtschaftsgeschichte_ein-jahrtausend_north-ambrosius-jenks-metz-schro.pdf') . As far as I can see HTMLTOOLS does not support splitting pdf files ...

You can use -burstpdf parameter to Burst PDF file into single page PDF files, for example,

htmltools.exe -burstpdf D:\temp\Edit4.pdf D:\temp\Edit4-out.pdf

>>Problem#2: Merging splitted PDF files (splitted with PDFSPLITMERGE) have lost annotations

-mergepdf parameter does keep the annotations automatically, for example,

htmltools.exe -mergepdf "C:\1.pdf|C:\2.pdf|C:\3.pdf" C:\out.pdf
htmltools.exe -mergepdf "C:\*.pdf" C:\out.pdf
htmltools.exe -mergepdf "C:\filelist.txt" C:\out.pdf
htmltools.exe -mergepdf "C:\test-extraction.pdf*10-15 C:\out10-15.pdf
htmltools.exe -mergepdf "C:\test-extraction.pdf*500-501 C:\out500-501.pdf

VeryPDF
----------------------------------------
Hello, thanks for the hints ...

There is one more question regarding HTMLTOOLS:

when merging extracted pages to a single PDF file, the output file has the same page format as the single pages. This is what is intended ...

But when extracting only some pages from a source pdf file directly into a merged pdf file (e.g. htmltools.exe -mergepdf "C:\test-extraction.pdf*10-15 C:\out10-15.pdf") then the page format changes and the source pages appear very small in the merged target file (see screenshot) - i.e. the main part of the pages in the new merged pdf file is empty. How to prevent this ?

Thanks.

Customer
----------------------------------------
Thanks for your message, htmltools.exe does merge your PDF files properly, you may run following command line to try,

htmltools.exe -mergepdf "D:\downloads\zip(1)\1941Heft03_04_Slowakei.pdf|D:\downloads\zip(1)\deutsche-wirtschaftsgeschichte_ein-jahrtausend_north-ambrosius-jenks-metz-schro.pdf" D:\downloads\zip(1)\out.pdf

VeryPDF
----------------------------------------
Hello,

thanks for responding.

>Thanks for your message, htmltools.exe does merge your PDF files properly,
>you may run following command line to try,

This id an earlier problem with another tool which lost annotation info during merging...

The current problem with HTMLTOOLS is (see above):

"... when extracting(!!) only some(!!) pages from a source pdf file directly into a merged pdf file (e.g.

htmltools.exe -mergepdf "C:\test-extraction.pdf*10-15 C:\out10-15.pdf" "c:\out.pdf"

), then the page format changes and the source pages appear very small in the merged target file (see screenshot) - i.e. the main part of the pages in the new merged pdf file is empty. How to prevent this ?"

Tot test it, please take - instead of the demo name ("test-extraction.pdf*10-15") - the larger file ("deutsche-wirtschaftsgeschichte_ein-jahrtausend_north-ambrosius-jenks-metz-schro.pdf") from my file that I have sent.

Thanks

Customer
----------------------------------------
Hello,

try THIS with the file I've sent you ...

htmlconverter.htmlconverter "-mergepdf", "Deutsche-Wirtschaftsgeschichte_Ein-Jahrtausend_North-Ambrosius-Jenks-Metz-Schroeter-Ziegler_1.PDF*1*2*3*36*124*142*159*160*191*192*193*256*286*287*307*341*356*357*523", "north_part_from_original2.pdf"

Then you see the effect shown above in the screenshot.

Thanks
Customer
----------------------------------------
Hello again,

when I pass the command line "htmlconverter.htmlconverter "-mergepdf", "Deutsche-Wirtschaftsgeschichte ..." WITHOUT a page range then it works and the resulting (target) file has the same page format as the source. But as soon as I specify pages to be extracted/merged ("*1*2*5*7 ...") then the target page format changes very for a unknown reason to a specific but unknown format ... as documented (see screenshot).

Customer
----------------------------------------
We have created a new version of htmltools.exe for you, you can run following command line to extract and merge your PDF pages to a new PDF file properly,

htmltools.exe "-$" "XXXXXXXXXXXXXXXXXXXX" –mergepdf
"%CD%\test.pdf*1*2*3*36*124*142*159*160*196*198*523" "%CD%\out.pdf"

htmltools.exe –mergepdf "D:\test.pdf*1*2*3*36*124*356*357*523" "D:\out.pdf"

VeryPDF
----------------------------------------
Hello,

thank you VERY much for your support! Now, it seems to work properly !

Best regards,
Customer
----------------------------------------

VN:F [1.9.20_1166]
Rating: 10.0/10 (2 votes cast)
VN:F [1.9.20_1166]
Rating: 0 (from 0 votes)
How to extract pages from several PDF files and merge them into a new PDF file?, 10.0 out of 10 based on 2 ratings

Related Posts

Leave a Reply

Your email address will not be published. Required fields are marked *


Verify Code   If you cannot see the CheckCode image,please refresh the page again!