Split very large PDF

Post Reply
User avatar
Terkelsen
Advanced member
Posts: 229
Joined: Thu Sep 08, 2011 5:08 pm
Contact:

Split very large PDF

Post by Terkelsen » Wed May 22, 2019 10:30 am

One of my customers is receiving on a regular basis a PDF containing 100.000 pages :!: for printing. Yes, I know it is insane but that's how their customer insist to deliver.

I'm not surprised that their RIP is giving up on this job trying to feed the printer with 800 pages pr. minute. They have found that splitting the PDF into smaller portions will help and we try to automate this, but it takes forever. Split PDF feature in Switch gave up even though the time out was set to 1000 minutes.

Next thing is that we would want to rasterise the PDF. This action takes 17 minutes for a sample file of 100 pages, which means that it will take 14 days to manage 100.000 pages.

Does any of you have experience with a method that can handle this amount of pages in an acceptable time frame?

We all know that the correct way to handle this would be to ask the customer to deliver a two page PDF and a file containing all the variable data, but my customer got this order exactly because they accepted to receive the file like this. :roll:

jan_suhr
Advanced member
Posts: 287
Joined: Fri Nov 04, 2011 1:12 pm
Location: Nyköping, Sweden

Re: Split very large PDF

Post by jan_suhr » Wed May 22, 2019 10:42 am

Have you tried PDF Toolkit?

https://www.pdflabs.com/docs/pdftk-man-page/

Look at the cat command
Jan Suhr
Color Consult AB
Sweden
=============
Check out my apps

User avatar
Terkelsen
Advanced member
Posts: 229
Joined: Thu Sep 08, 2011 5:08 pm
Contact:

Re: Split very large PDF

Post by Terkelsen » Wed May 22, 2019 11:19 am

Hi Jan,

No, but I've tried ImageMagick and after two hours it's still working to just split a 100 page PDF.

Any idea what the argument should look like in Execute Command if I use PDFtk?

Zoranj
Member
Posts: 90
Joined: Tue Sep 20, 2016 7:37 pm
Location: Canada

Re: Split very large PDF

Post by Zoranj » Wed May 22, 2019 2:50 pm

I am afraid Pdftoolkit will give you similar performance like split pdf configurator.
At least that was my experience when I was in same situation.
I tried emailing author of Pdftollkit to ask if there is an option to recompile it to use more memory, never got response from him.

jan_suhr
Advanced member
Posts: 287
Joined: Fri Nov 04, 2011 1:12 pm
Location: Nyköping, Sweden

Re: Split very large PDF

Post by jan_suhr » Wed May 22, 2019 3:19 pm

Terkelsen wrote:
Wed May 22, 2019 11:19 am
Hi Jan,

No, but I've tried ImageMagick and after two hours it's still working to just split a 100 page PDF.

Any idea what the argument should look like in Execute Command if I use PDFtk?
You will use the something like this:

pdftk A=incoming.pdf cat A1-A5000 output outgoing.pdf

This command will make a new PDF from page 1 to page 5000.

With my app Execute command friend you could set up this with variables and a flow that loops and calculate new values for A1 and A5000.

With scripting you could do it even simpler if you have the Scripting module.
I think that the hard part here is the writing of the new file.

Is this some kind of variable job and they give you the total file for it?
As you know there are a lot of smart solutions out there to do it.
Jan Suhr
Color Consult AB
Sweden
=============
Check out my apps

User avatar
Terkelsen
Advanced member
Posts: 229
Joined: Thu Sep 08, 2011 5:08 pm
Contact:

Re: Split very large PDF

Post by Terkelsen » Wed May 22, 2019 4:13 pm

Hi Jan,

Yes, the stupid thing is that this is actually just a two page PDF with a variable number, a name and an address on page 1, but the customer insist on delivering this as a 100.000 pages PDF :roll:

Excuse my ignorance but what would the argument you suggest look like as an argument in Switch Execute Command?

jan_suhr
Advanced member
Posts: 287
Joined: Fri Nov 04, 2011 1:12 pm
Location: Nyköping, Sweden

Re: Split very large PDF

Post by jan_suhr » Wed May 22, 2019 4:52 pm

PDFToolkit is a little problematic to run directly in Execute command. If you are on Windows you need to create a .bat-file.

If you download the app I mentioned I think that there is a pdftk example for something like this.

For the customer, tell them that what send now will cost them €10000 extra, if they let you do the data handling it will not cost anything more. :D
Jan Suhr
Color Consult AB
Sweden
=============
Check out my apps

jan_suhr
Advanced member
Posts: 287
Joined: Fri Nov 04, 2011 1:12 pm
Location: Nyköping, Sweden

Re: Split very large PDF

Post by jan_suhr » Wed May 22, 2019 4:55 pm

Send me an email and I can tell you about a solution that might be acceptable for both of you.
Jan Suhr
Color Consult AB
Sweden
=============
Check out my apps

Post Reply