Export PDF to Excel comes up empty

Report · Apr 25, 2017

Hello,

Running a PC on Windows 7 Enterprise, 32-bit. On one computer, I have Adobe Acrobat Pro 7, and on another, I have Adobe DC, and have tried using both programs to export a PDF to an Excel, without success.

A team member was able to do this a few weeks ago on Adobe DC (have never been successful with Pro 7, and it is no longer working in Adobe DC) with the particular PDF documents that we are trying to export, but we are no longer able to make this happen. In Adobe DC, we follow the standard rules of exporting PDFs (choose Export from Tool panel, choose spreadsheet, etc). It does "export" the PDF, but when the exported document opens in Excel, the Excel is completely empty. I tried exporting this same PDF to Word and it was mostly successful.

Has anyone experienced an empty export when trying to convert PDF to excel? I'm also happy to send the PDF to someone else so you can see if you experience the same problem...

Thanks for any assistance.

Jessica

Report · Apr 25, 2017

Does your PDF contain actual form fields?

Report · Apr 26, 2017

It does not—it is a PDF of a web page with some data on it that we want to transform into Excel for analysis (cutting/pasting puts the columns out of order). What do form fields do when trying to convert PDF to Excel?

Report · Apr 26, 2017

Form fields are not being converted, so if your complete document is made up form form fields, you would not get any information in your resulting Excel file.

I assume that the file you are trying to convert does not contain real text - very likely because somebody converted all text to outlines in order to make it harder to extract data. Can you select text and copy and paste it into e.g. a Word document?

Converting from PDF to Word, Excel or any other format is one of the most complex things you can try to do with a PDF file. It works very well in some cases, in other cases the output has very little to do with the original file. The key for success is that the PDF file needs to be "tagged" - which means that it contains information about the information that is displayed in the file. The best way to make sure that a PDF file is tagged correctly is by using the PDFMaker in Acrobat to create the PDF file from Word or Excel (that's the Acrobat ribbon or toolbar).

Sometimes it helps to save the PDF file as a set of high resolution (e.g. 600dpi) images, then import these images back into Acrobat, run OCR and then export to Word or Excel again.

There are other tools available that can convert PDF to Excel. Whenever I come across a file that does not want to behave (and I don't want to go through the process for converting to an image and importing again), I give Tabula (http://tabula.technology) a chance. However, because you are not getting anything, I suspect that this will not work either.

Report · Apr 26, 2017

Hi Karl, I appreciate your detailed response. I think I’m understanding what you mean—we hired this company to host data for us and they are only willing to do one data dump per month, which really doesn’t work for my team since we are in an iterative pilot phase of the project and need to be modifying our procedures based on the data outcomes.

When I exported the PDF as a Word document, it came out a bit garbled but was mostly legible—but if I’m understanding you correctly, I think what you’re saying about converting actual text to outlines is correct, because all of the words and numbers are little image type boxes (rather than being able to move a cursor through them freely).

I just tried copying the things we need into a word document and I’m able to manipulate it a bit there, but ultimately need to pass it on in an excel version. This is a possible workaround, but unfortunately would be very time consuming since we have to get data from ~40 different sites, and then manually manipulate them all.

Since we are creating this PDF from a website that is hosting our data, it must not be ‘tagged’ as you have explained.

I also found out from a team member that he has previously been able to do this with Adobe Acrobat 10, but my other colleague and I are having difficulty with Acrobat DC and Acrobat 9.

We’ll have to find a workaround. Thank you.

Jessica

Report · May 03, 2017

If you get date in Word, chances are that the document is not converted to outlines. You may want to give a 3rd party tool a try: Tabula: Extract Tables from PDFs Sometimes I get better result with Tabula, but it's very slow.

You can also try to export the PDF files as a series of high resolution TIFF images (e.g. 600dpi) and then import these images back into Acrobat as a PDF file. Then run OCR and try to export again.

Report · Aug 31, 2018

I am experiencing the same blank page popping up when I try to export PDF to WORD and EXCEL. I have been able to this in the past, but this week I have not been able to use the tools. The reason I purchased ADOBE was to be able to do the things I need to do. So, I am not interesting all the complicated workarounds. I want product to work as it used to work. Advice appreciated.

Report · Aug 31, 2018

Problem solved! When I right clicked on desktop pdf copy of data, I noticed the default was Adobe Acrobat 15......When I chose Adobe Acrobat 18.....the conversion happened easily and I got to the familiar page. How this default was decided I have no clue. But I am happy happy to be back in business.

Adobe Community

Export PDF to Excel comes up empty

1 Correct answer