• Global community
    • Language:
      • Deutsch
      • English
      • Español
      • Français
      • Português
  • 日本語コミュニティ
    Dedicated community for Japanese speakers
  • 한국 커뮤니티
    Dedicated community for Korean speakers
Exit
0

Recognize Text: Not working on large files.

New Here ,
Dec 01, 2017 Dec 01, 2017

Copy link to clipboard

Copied

I need to be able to send attorneys PDF files that contain thousands of pages of PDFs. (It needs to be in one PDF.) When I use the Recognize Text feature and run it through the file it appears to be converting since I see the status "Converting scanned page to Searchable Image", but when it's complete and I go to do a search (Control + F) it doesn't work.

Has anyone else experienced this and know what to do to fix this? Or a workaround. This is a functionality that we need to have.

TIA!

TOPICS
Scan documents and OCR

Views

788

Translate

Translate

Report

Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
New Here ,
Dec 02, 2017 Dec 02, 2017

Copy link to clipboard

Copied

I've noticed that long docs (400 + pp) take a lot of quiet time at the end to consolidate fonts etc. and your PC might go to sleep before finishing.  That and/or other interruptions can KO the whole process.  I'm not an expert but I'd try splitting super-long docs into 100 page sections, then optimizing or OCRing each section, and then joining them back together.  That way you lose less time if one section goes south and needs to be done over. 

Votes

Translate

Translate

Report

Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
New Here ,
Dec 04, 2017 Dec 04, 2017

Copy link to clipboard

Copied

LATEST

Thanks for the reply! In this particular instance where I found this to be a problem was where the PDF file had over 7,000 pages. The thought of breaking it into smaller chunks was our resolution, although tedious. I get that it's not exact science. Just wondering if there was any other workaround out there.

Votes

Translate

Translate

Report

Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
LEGEND ,
Dec 03, 2017 Dec 03, 2017

Copy link to clipboard

Copied

It might also be that the original is not suitable for converting to text. For example, handwritten material won't convert. Poor scans, scans that aren't written as English paragraphs with correct spelling, weird fonts: all these have low success rates. It's all guesswork, not an exact science.

Votes

Translate

Translate

Report

Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines