Skip navigation
Currently Being Moderated

Other options to convert scanned-document PDF into text?

Sep 19, 2011 8:37 AM

I often get PDF files where the document was scanned in as an image. I'm wasting time having to re-type these. I found a way online to let Google do a conversion but it is slow (link below if you're curious).  What are some other options for converting this type of PDF into real text?

 

http://www.labnol.org/software/convert-scanned-pdf-images-to-text-with -google-ocr/5158/

 
Replies
  • Currently Being Moderated
    Sep 19, 2011 8:42 AM   in reply to HealthcareHelper

    You can perform OCR in Adobe Acrobat.

     
    |
    Mark as:
  • Currently Being Moderated
    Sep 19, 2011 12:53 PM   in reply to HealthcareHelper

    Acrobat performs OCR on the document. You can seöect the text and copy it to the clipboard.

     
    |
    Mark as:
  • Currently Being Moderated
    Sep 19, 2011 9:20 PM   in reply to HealthcareHelper

    It is in the PDF, not some place else. You can then save as a text file (or DOC) and get the text. Apparently you are using the searchable image, but could also use ClearScan that replaces the image of text with the text where it thinks it is successful. The searchable text is found on a different layer in the PDF file.

     
    |
    Mark as:

More Like This

  • Retrieving data ...

Bookmarked By (0)

Answers + Points = Status

  • 10 points awarded for Correct Answers
  • 5 points awarded for Helpful Answers
  • 10,000+ points
  • 1,001-10,000 points
  • 501-1,000 points
  • 5-500 points