Skip navigation
JFriddle12
Currently Being Moderated

Converting PDF files to Excel Documents Inquiry

Aug 6, 2012 7:50 PM

Tags: #to #excel #converting

I have Financial Statements like the Statement of Assets and Liabilities that I have had difficulty converting from a PDF file to an excel using Acrobat X Pro. 

 

In the Statement of Assets and Liabilities you have one column with descriptions of Assets and Liabilities and a second column with numerical values.  My problem is sometimes when converting from PDF to excel the asset and liabilities share the same cell as the numerical value and other times they are in different cells.

 

What I am trying to achieve is consistently having an excel file with numerical values in a separate column.  I have been told that the PDF that I am trying to convert may be too complex for the optical character recognition (OCR) technology used in Acrobat X Pro.   Does anyone have any suggestions?

 

Thank you

 
Replies
  • Currently Being Moderated
    Aug 7, 2012 12:49 AM   in reply to JFriddle12

    In a practical sense there's not a lot you can do - if Acrobat is trying to export a scanned file it has no idea what the structure is, so has to make guesses based on the X/Y position of each text object (it can't see the borders or fills as those aren't OCRed). If your PDF came from an authoring application and had structure tags it would export perfectly, but putting tags onto a table in an OCRed file is a nightmare of a job - it'd be quicker to type it out again in Excel.

     
    |
    Mark as:
  • Currently Being Moderated
    Aug 7, 2012 6:00 AM   in reply to Dave Merchant

    You can try doing column copies. That might do the job better than what you are doing currently.

     
    |
    Mark as:

More Like This

  • Retrieving data ...

Bookmarked By (0)

Answers + Points = Status

  • 10 points awarded for Correct Answers
  • 5 points awarded for Helpful Answers
  • 10,000+ points
  • 1,001-10,000 points
  • 501-1,000 points
  • 5-500 points