1 person found this helpful
I've noticed that long docs (400 + pp) take a lot of quiet time at the end to consolidate fonts etc. and your PC might go to sleep before finishing. That and/or other interruptions can KO the whole process. I'm not an expert but I'd try splitting super-long docs into 100 page sections, then optimizing or OCRing each section, and then joining them back together. That way you lose less time if one section goes south and needs to be done over.
2 people found this helpful
It might also be that the original is not suitable for converting to text. For example, handwritten material won't convert. Poor scans, scans that aren't written as English paragraphs with correct spelling, weird fonts: all these have low success rates. It's all guesswork, not an exact science.
Thanks for the reply! In this particular instance where I found this to be a problem was where the PDF file had over 7,000 pages. The thought of breaking it into smaller chunks was our resolution, although tedious. I get that it's not exact science. Just wondering if there was any other workaround out there.