Extract Text from PDF for Metadata/Private Data Use

Post Reply
GSBSwitch1
Newbie
Posts: 13
Joined: Thu Jan 07, 2016 8:42 pm

Extract Text from PDF for Metadata/Private Data Use

Post by GSBSwitch1 » Tue Jan 05, 2021 10:24 pm

Is there a way to search for and then extract and use a string of text from a PDF file to be used within Switch as either metadata or private data (or really anyhow) so that I can rename the PDF file? I want the name of the incoming PDF to be named by the string of text within the PDF.

I can search for the text segment in Pitstop but I couldn't figure out a way to maybe log the variable text selection. It seems simple enough but I am stuck on finding a solution.

--Evan

abailescollins
Advanced member
Posts: 380
Joined: Wed Apr 22, 2015 4:28 pm

Re: Extract Text from PDF for Metadata/Private Data Use

Post by abailescollins » Wed Jan 06, 2021 10:50 am

I think we can do this by getting the text you need reported in the XML report so you can access it, and then use it as a variable to rename the file.
Maybe drop me a mail with some examples of what you want to do, and we can experiment.

I seem to remember doing this before with a customer.
PitStop Product Manager @ Enfocus.
andrewb@enfocus.com

GSBSwitch1
Newbie
Posts: 13
Joined: Thu Jan 07, 2016 8:42 pm

Re: Extract Text from PDF for Metadata/Private Data Use

Post by GSBSwitch1 » Wed Jan 06, 2021 4:08 pm

Andrew -- I emailed you directly with example file to test.

GSBSwitch1
Newbie
Posts: 13
Joined: Thu Jan 07, 2016 8:42 pm

Re: Extract Text from PDF for Metadata/Private Data Use

Post by GSBSwitch1 » Fri Jan 08, 2021 3:52 pm

Curious if there is any other idea in how to accomplish this. I am starting with a CSV file that will trigger a SmartStream Deisgn VDP config which creates a multi page/record PDF. I then split that PDF into individual PDFs per record (every 8 pages of the original PDF). I then need to name that personalized PDF by the string "EMPLID_Lastname_FinAid2021.PDF". All is static except the last name which is all from the original CSV.

The one idea I was trying is to write that string within Indesign SmartStream Designer template into the live area so that I could hopefully search for it and then use it within Switch to rename the PDF name.

Appreciate any ideas.

--Evan

Post Reply