6 Replies Latest reply: Jan 13, 2012 10:14 AM by jt553 RSS

    Export PDF to XML

    DAJINKYA

      Hi,

       

      I am trying to export pdf to xml using Adobe Acrobat Professional.

      I can export the data pretty nicely, but it is not exporting the headers/Footers from the PDF.

       

      Is there a way to extract headers/footers of the pdf document?

       

      Thanks

      AJ.

        • 1. Re: Export PDF to XML
          lrosenth Adobe Employee

          PDF documents don't have "headers" or "footers" - it's all simply page content.

           

          Also, you don't mention what method(s) you are using to export the XML and with what version of Acrobat and SDK.

          • 2. Re: Export PDF to XML
            DAJINKYA Community Member

            Irosenth,

             

            Apology for incomplete information, I am using Adobe Acrobat 9.0 Pro.

            And the way I am exporting it to xml is "File->Export->XML" or "File->SaveAs->xml"

             

            Well, our pdfs are converted using some free java library, it a word document which has header & footer, and then it is converted into pdf using that java library.

             

            So when I export that pdf to xml from adobe acrobat pro, I don't see header and footer value in the xml, rest all looks fine.

            • 3. Re: Export PDF to XML
              lrosenth Adobe Employee

              That area may be identified as an artifact, so it isn't getting exported.  Without seeing a file, it's difficult to say.

              • 4. Re: Export PDF to XML
                DAJINKYA Community Member

                Actually those are the Termsheet PDF,and they are confidential so I cannot share.

                 

                So there is no way to get data from artifact(if at all it is identifying as artifact)?

                 

                My purpose is to extract that data from pdf and validate against some expected data.

                 

                I tried couple of tools online which coverts pdf to word. but I dont find it worth comparing those converted word doc

                And xml seems reliable.

                • 5. Re: Export PDF to XML
                  DAJINKYA Community Member

                  how do I attach a document here?

                   

                  I have created a dummy pdf which has the header and footer as i said in my previous conversation.

                  also the xml exported out of it.

                  • 6. Re: Export PDF to XML
                    jt553

                    DAJINKYA wrote:

                     

                    Hi,

                     

                    I am trying to export pdf to xml using Adobe Acrobat Professional.

                    I can export the data pretty nicely, but it is not exporting the headers/Footers from the PDF.

                     

                    Is there a way to extract headers/footers of the pdf document?

                     

                    Thanks

                    AJ.

                     

                    Another way you can do that is to use EDI Link Connect. It can export data from a PDF (headers & footers included). The XML will be structured properly so you can immediately import into the program of your choice. Its intended for Business documents like Orders,Invoices,Shipping,Reports etc. I'm not sure if thats the type of PDFs you're looking for but if it is, that might be something to look at. Here is a link with more info:

                     

                    Converting PDFs to XML with EDI Link:http://ecdynamics.com/pdf-conversion.php

                    as well as another article:

                    http://softertech.wordpress.com/2011/12/12/importing-pdfs-into-quickbooks-or-simply-accoun ting/