I'm using getpagenthword and getpagenthwordquads to return words and positions using vb.net and an object jsObject. I pass the words to a string variable and the positions to an array. So everything works fine until getpagenthword comes to symbol character like # or - on it's own. My string doesn't have anything in it but my getpagenthwordquads gives the co-ordinates. I'm using acrobat X SDK. Anyone know what might be going on?
Have you tried specifying getPageNthWord's bStrip parameter as false?
Excellent that looks to have sorted it.