we are using the iFilter of Adobe Reader 9 for parsing PDF Dokuments to an fulltext index. Now we got some documents which are making trouble.
This document could be opened and read in Adobe Reader 9 perfectly, but not using the iFilter.
Another strange behaviour of this documents appears during copying text from the PDF into an editor such as notepad or wordpad. Instead of the text we just get strange chracters for exampel " ( ( , (4" instead of "This is to certify". There are embedded fonts in the document for example "TTC01o==" which I have never seen before.
Is there anyone who can help me solving this problem?
Looks like fonts with custom encoding.
and thanks for this fast reply. Is there a possibility to manage this PDFs with custom encode fonts via using the Adobe iFilter?
We also consider to buy a third party PDF iFilter to solve the problem. We checked out foxit and tet but they also made the same trouble.
Maybe you can give me a hint to solve this problem.
You may look for a program which can modify the font encoding.