-
1. Re: How can adding hyperlinks cause a 740,000% increase in the amount of "Structure info" data in a PDF?
Sabian Zildjian Aug 28, 2013 8:11 AM (in response to alanomaly)What kind of hyperlinks? Are they web links to external pages/pdfs/files? Or are they PDF bookmarks, Named Destinations, and TOC hyperlinks? The later types will take up considerably more data within the PDF file. The tagging of those types of links will add more data as well.
-
2. Re: How can adding hyperlinks cause a 740,000% increase in the amount of "Structure info" data in a PDF?
alanomaly Aug 28, 2013 9:53 AM (in response to Sabian Zildjian)They're all regular web links to http:// web pages, except one which is a http:// url to a downloadable PDF (for some reason on my machine this one opens in a different browser - Safari, while the others open in my default browser Chrome - I think Acrobat must have a different default for urls ending in .pdf).
-
3. Re: How can adding hyperlinks cause a 740,000% increase in the amount of "Structure info" data in a PDF?
Dave Merchant Aug 28, 2013 9:56 AM (in response to alanomaly)Without access to these two different files, there's no way anyone on here can tell what's going on.
-
4. Re: How can adding hyperlinks cause a 740,000% increase in the amount of "Structure info" data in a PDF?
alanomaly Aug 28, 2013 10:06 AM (in response to Dave Merchant)I'm not expecting a perfect magic answer to drop from the sky... but I'm sure people with an understanding of what this "Structure Info" is actually used for can share insight or suggest possibilities.
For example, I've done quite a lot more reading and research, and I stumbled on something that suggested the data the Audit tool calls "Structure info" corresponds to what the PDF Optimiser options call "document tags". That gives me an avenue of investigation. There were probably people on this forum who knew that off the top of their heads.
I've also learned that accessibility features can create complex tagging structures in a PDF, and that this is one known possible cause of bloated "Structure Info" - so I've started checking back through my workflow in case anything could have added any accessibility related tagging that I wasn't expecting. It might not be the cause, but it's a possibility and is something else that I'm sure some people here knew off the top of their heads.
-
5. Re: How can adding hyperlinks cause a 740,000% increase in the amount of "Structure info" data in a PDF?
Dave Merchant Aug 28, 2013 1:03 PM (in response to alanomaly)The 'structure' audit includes all the tagging and accessibility features, that's made clear in the help files. What you're asking is why this particular file is getting 1.5MB of extra stuff, and we can't possibly comment without seeing the file. You're evidently not comparing like with like, as the size of all the other audit blocks are completely different.
-
6. Re: How can adding hyperlinks cause a 740,000% increase in the amount of "Structure info" data in a PDF?
Test Screen Name Aug 28, 2013 1:55 PM (in response to Dave Merchant)The interesting thing is that you are clearly adding lots of JavaScript. This doesn't appear to be accounted. Maybe the structure info is including or comprising that. Just a thought.
However, I can't understand how the first file could possibly be tagged - tagging is a big overhead and can multiply the size of a text-only file.
-
7. Re: How can adding hyperlinks cause a 740,000% increase in the amount of "Structure info" data in a PDF?
alanomaly Aug 29, 2013 3:51 AM (in response to Dave Merchant)Test Screen Name: It looks like your hunch about the difference being that the first doc is untagged is right - and it looks like tagging is the root of the problem.
I've been back over the process, making sure that I wasn't allowing InDesign to include document tags this time, and the amount of structure info is back down to normal and Acrobat is no longer struggling to cope with the documents.
Why the hyperlinks cause so much additional tagging, I don't know. What the difference is under the hood between hyperlinks in a tagged and non-tagged PDF, I don't know (all I know about hyperlinks under the hood is that they can be implemented as javascript or named destinations, according to your comment on my other thread). Not sure how to find out more - this is beyond the level of detail that normal manual docs or guides go into. But it looks like I'm on the right track now, so maybe I don't need to.
---------
Dave Merchant: before giving people the "RTFM" treatment, it's polite to read the manual yourself, here's the entirity of Adobe's online help page about the Acrobat space audit feature:
Audit the space usage of a PDF
Note the absence of any sentence stating anything like " 'Structure info' refers mainly to document tags, which are used for things including defining reading order of text elements for screen reading, SEO and other automatic processing. This can add a lot of weight to a document, especially if the document is intended to be fully accessible". I learned that from other (more helpful) forum threads I found on loosely related topics.
The only search results for "structure info" in the adobe.com Acrobat docs help section are two PDFs referring to a seemingly unrelated concept of the same name in the context of the Acrobat API and SDK.
I still don't know what else other than tagging comes under the heading "Structure info" (if anything else does), but I do now know one thing about what "Structure info" might mean, and that one thing is proving very useful.





