2 Replies Latest reply: Dec 2, 2014 11:18 AM by tracy_dc RSS

    Export to HTML - inconsistent table formatting

    tracy_dc Community Member

      I have over 100 PDFs that I batch-converted to HTML. The only difference between the PDF files was page counts (ranging from 1-50) and none of the PDFs are tagged. The PDFs contained long charts/tables with consistent layouts.

       

      The results were inconsistent in that half of the HTML files came out with a properly-formatted table while the other half came out with basically an HTML header/footer and a few random styles but converted the table to text only. This text only issue tended to occur on the short (1-2 page) PDFs.

       

      I am using Acrobat X Pro.

       

      Any input is appreciated.

       

      Tracy

        • 1. Re: Export to HTML - inconsistent table formatting
          CtDave CommunityMVP

          Keep in mind the "tables", "rows", "columns", "styling", etc. (all word processor features) are not present in/on a PDF page.

          As the "export" of non-tagged PDF page content is improved with Acrobat XI one can still expect to do "clean up".

          Basically, that's what you must do now to your exported content.

           

           

          Be well...

          • 2. Re: Export to HTML - inconsistent table formatting
            tracy_dc Community Member

            The weird thing is that none of the PDFs were tagged, but some of the exported HTML files included HTML tables while others didn't. I actually spent a long time yesterday manually tagging one of the problematic PDFs with a table structure but the results were no different than when I exported the untagged version.