I need help with scanned documents!
How can highlight parts and recognise (OCR) only the those I selected?
I have about 500 books scanned and I would need only the important parts.
I don't see any meaning to OCR all of the pages, since I need only 1% of the material.
Could I higliht some parts some kind of way (with a tablet for example) and afterwards make the OCR on that selected parts only and error correction in another more powerfull machine?
Or just keep those small parts in a separate file without OCR and editing?
Well, you would need Adobe Acrobat at a minnimum if you want to deal with OCR since Reader doesn't have that ability.
As far as OCRing just the highlighted areas, I doubt it. There is no straight forward way to do that but I guess there could possibly be a script out in the wild but I haven't seen one. For starters you would need to figure out how to highlight text that isn't text.
If it's just ceratin pages you need OCR done on, you could extract those pages as separate files and OCR them.
The problem is:
I have to hihlight the reqiured text othervise I will never found it.
Than I have to make a new PDF that contains only the required part.
But it would be way better to have the possibility to highlit and OCR it "real time" then export it and than I can place it to the specific folder/file so it will be not only organized, but searchable as well.
Something like that could be added as a feature I guess. It saves a big bunch of time for the user (specially on an old PC) and also saves recources (power ect)
I have many books that is 4-500pages and I would need about 10 of them, but into various files.
Any help wellcome
You can put in a feature request at https://www.adobe.com/cfusion/mmform/index.cfm?name=wishform
Be sure to mention Acrobat and not Reader (this is the Reader forum).
Not only is it not possible to OCR just parts of a page (you can specify a specific page range, though), I doubt it will ever be implemented. It just doesn't make sense. Why would you want only some of the text on a page to be "selectable" and not the rest? I don't really see how it saves any time or resources. It's much more efficient to just OCR the entire file (or page), then to launch the OCR process each time a highlight is made...