I have a problem that I can't find a solution to, concerning OCR scanning on older, scanned documents in Acrobat Pro X.
When I Clearscan a report on, let's say, 40-100 pages, no matter if it's in 300dpi or 600dpi, something very strange happens. Acrobat does a very poor interpretation of the first part of the document. When I copy and paste into a word processor, a sentence that originally goes "This is a report" comes out as "T h i s i s a r e p o r t". Between almost every letter/character there is a space. A few pages into the Clearscanned document, however, the result is better, and the text (which is the same as the former text) does not have spaces between characters.
Even more strange, when I scan one single page from the document - a page that from the first Clearscan (of the whole document) gave this non-searchable text - I get a good result! No spaces between characters here.
I'm going insane here, trying to figure out what's going wrong and what I can do to get around it. It seemingly has nothing to do with the scan resolution, neither with language settings or any other settings I have tried. The problem rather seems to be that Acrobat Clearscans differently depending on how many pages are being scanned, or where these pages are in the document/clearscanning process.
So, does anyone have an solution to this? I have MANY documents to scan and I don't have time to go into every one of them to manually correct the errors, so I would prefer a solution where this problem just doesn't occur to begin with. Am I doing something stupidly wrong to begin with, that creates this problem?
Very thankful for any help
I should add that I have tried OCR scanning the document with Searchable Image, and dis problem didn't occur, but I really need to do it with Clearscan to keep the file size down and have it look better.
I think I found the solution to your problem. Just save the problematic OCR Clearscan document as Text (Plain). By just doing so, Acrobat Pro corrects and eliminates all the spaces between characters. Then you only need to save again the PDF document, and you will see (even if hard to believe) that it is already fixed. At least it worked very well to me!