Hi,
	
	I have a PDF with XML data embedded in the file (jdf).
	I can see this if i open the PDF as a text file.
	It isnt visible in the metadata when i search for it.
	
	I have tried to save the PDF in acrobat and export to XMl but thats not working (error messages).
	
	With what application can i get the data out?
	
			
			
									
						
										
						Getting the XML out of the PDF
- 
				dkelly
 - TOP CONTRIBUTOR
 - Posts: 658
 - Joined: Mon Nov 29, 2010 8:45 pm
 - Location: Alpharetta GA USA
 - Contact:
 
Getting the XML out of the PDF
Apago's PDFspy
			
			
									
						
										
						- 
				Peter Kleinheider
 - Newbie
 - Posts: 17
 - Joined: Mon Dec 13, 2010 4:52 pm
 
Getting the XML out of the PDF
Flow666 wrote: Hi,
	
I have a PDF with XML data embedded in the file (jdf).
I can see this if i open the PDF as a text file.
It isnt visible in the metadata when i search for it.
	
I have tried to save the PDF in acrobat and export to XMl but thats not working (error messages).
	
With what application can i get the data out?
	
	
	
Can you please provide the PDF as there are various ways to extract the JDF from the PDF to get access to the XML data.
	
peter[at]inpetto[dot]cc
	
Thx,
Peter Kleinheider
			
			
									
						
										
						I have a PDF with XML data embedded in the file (jdf).
I can see this if i open the PDF as a text file.
It isnt visible in the metadata when i search for it.
I have tried to save the PDF in acrobat and export to XMl but thats not working (error messages).
With what application can i get the data out?
Can you please provide the PDF as there are various ways to extract the JDF from the PDF to get access to the XML data.
peter[at]inpetto[dot]cc
Thx,
Peter Kleinheider
- 
				Clive Andrews
 - Member
 - Posts: 85
 - Joined: Thu Jun 23, 2011 11:41 am
 
Getting the XML out of the PDF
Yeah - if you can put a link to it, I'll have a look too...
			
			
									
						
										
						- 
				Peter Kleinheider
 - Newbie
 - Posts: 17
 - Joined: Mon Dec 13, 2010 4:52 pm
 
Getting the XML out of the PDF
Good afternoon,
	
the XML code you refer to is part of a PostScript Form XObject. I do not know of any software that extracts such PS-Parts as part of its functionality.
	
The only solution I know is to write a Switch Script that searches for such XML as part of PS Form XObjects and save it in a separate file or attach it as dataset.
	
If that is something you are interested in, just drop me a line on get in touch with other folks here on the list.
	
Cheers,
Peter
			
			
									
						
										
						the XML code you refer to is part of a PostScript Form XObject. I do not know of any software that extracts such PS-Parts as part of its functionality.
The only solution I know is to write a Switch Script that searches for such XML as part of PS Form XObjects and save it in a separate file or attach it as dataset.
If that is something you are interested in, just drop me a line on get in touch with other folks here on the list.
Cheers,
Peter