I've got a situation where I need to take numbers from a PDF file which is a previsouly scanned-in copy of an invoice. The PDF will be opened alongside a data entry form for the user to complete the form with information from the invoice. However, what I need to do is offer the user suggestions for some of the form fields (such as Net Total, VAT (Tax), Grand Total etc). I know that for specific suggestions for each field I would need some kind of 'zone' OCR, so if I could only pull out all numbers in the scanned image of the invoice from the PDF, I could offer all numbers as a suggestion in a drop-down.
I am using CFMX7, so I'm looking for a way to do this or some kind of component which will allow me to do this.
All the best
I am not sure it if will meet your needs, but you might look into jPedal. IIRC, it has some text extraction capabilities.