I have price sheets I scan into a file for OCR.
product "a" $15.95
product "b" $21.00
etc.
product "s" $32.95
The resulting document defaults to selecting columns of text (selection point like [procuct "b"] to bottom [product "s"] to top [$15.00, $21.00...] to the bottom of the page instead of interpreting data as a row (left to right).
Is there a means of forcing the OCR engine to regognize the text as rows of data instead of columns of data?
I tried to fake out Acrobat Pro XI by seeing if I could save the PDF to HTML and extract my tables. That did not work.
Saving to MS Excel doesn't work directly either. I had held out hope I could copy the tabular matter fron Excel into Word and preserve the cells' contents. No, when you paste from Excel to Word you still get stuck with columns.
Without this capability, I don't see much reason to upgrade from version 9 for what I need to do.
North America
Europe, Middle East and Africa
Asia Pacific