This content has been marked as final. Show 12 replies
No ideas here? Anyone want to even look at the PDF?
Why don't you go ahead and post a link to the PDF so a few folks can look at it.
I'm not sure what's going on. Searching for the word "pump" gives me 25 instances. Adding a space between the letters gives me 11 instances.
I notice that the fonts are not embedded in the file. That would be my first suspect. Try embedding them and see if the results are different.
Also, do you know what application was used to create the PDF (was it Acrobat or a third party application)? It doesn't say in file properties.
Fonts used are standard adobe fonts (courier). These are green bar reports that are converted to PDF using home grown software that I wrote.
I looked at the PDF itself in a text viewer (they are very simple PDF files) and there is no difference between the text lines where it finds PUMP or P U M P, which is very odd. 3rd party PDF viewers have no issues with it and find every instance of the word PUMP. (this problem isn't just with the word PUMP, but others as well).
I was hoping someone could hopefully find a blatant error in the PDF itself, or confirm that it could be a bug with Acrobat Reader itself.
> or confirm that it could be a bug with Acrobat Reader itself.
Please don't take this personally but it is more than likely a bug in your home grown software.
Adobe applications are a lot pickier about PDF standards than most third party viewers (not a bug but by design). If there's a problem with the way a PDF is created, Adobe apps will find it.
No problem. I won't take it personally. I'd like to know what the issue with the PDF is, though. As the lines are just plain text, the same on every line, and it still doesn't find the text in every instance. PDFs don't get much more simple than this one. :)
What could I use to find possible bugs? Reader doesn't report any. And it still works in other 3rd party PDF readers just fine.
You may want to ask around in the developers forum at http://www.adobeforums.com/webx/.eed56a0/.
> I looked at the PDF itself in a text viewer (they are very simple
> PDF files) and there is no difference between the text lines
I tried copying text from PDF to Notepad and this shows very
definite differences between lines. The ones where searching works
are normal, the rest have single spaces between characters. There
are some very knowledgeable people over in adobe.pdf.specifications
who'll be able to tell you what's wrong.
the issue seems to be with how do you do bold and normal weight fonts on the same line? Perhaps I'm doing it wrong?
I do one line with the normal text, and another with the bold (since they use different font IDs).
I have a large document with multiple occurrences of this problem (extra spaces between characters). It makes it impossible to search. All affected words are using a heading style that includes a bold font. An example: additional-data-1 is rendered as additional - d a t a - 1
Why no spaces in additional? None of these spaces is visible to the naked eye so when you see the word it looks as if it is correct.
Without seeing the actual data in the PDF, its hard to say.
My issue came from overlaying spaces over the text. So, lets say I had two lines with zero spacing that was:
It would make you have to search for Text1 wth spaces in it. Which I feel is, well, a slight bug. Why?
1. All 3rd party PDF viewers work fine
2. If you're searching wouldn't it search text "objects" (lines)??
3. Why if you overlay underlines doesn't it make you search for T_e_x_t_1?
Who knows. I just suggest to users now with this problem to use a 3rd party viewer or not include bolds in their documents that are converted to PDF.