Sorry i have submitted by mistake...
So do you know how to find all exponent and subscript characters in a document ?
I need to circle those caracteres by html tags (<e> or <inf>) in order to extract contents of documents in html.
It sounds like you're doing something very wrong.
How are you exporting your document before processing it for extraction?