1 Reply Latest reply on Aug 18, 2011 5:48 AM by MichaelKazlow

    Adobe Reader breaks text extraction possibility

    marterland

      Hi everyone!

       

      I am trying to find the cause of a very specific problem with three components involved.

       

      1. One specific type of pdf (created using the iText library)

      2. Adobe Reader

      3. Nova PDF Virtual printer.

       

      When I open the PDF in Adobe Reader and then print it with the Nova virtual printer, the resulting pdf looks fine. However, it is not possible to extract text from it (programatically). 

       

      If I repeat the procedure with Adobe Reader replaced with the PDF Reader "Nuance PDF Reader", it works as expected; I can extract text from the resulting PDF file.

       

      Can anyone shed some light on what is happening here? Why does Nuance succeed where PDF Reader fails? (I can mail sample files for both scenarios if needed.)

       

      Thanks in advance!

       

      /Martin