I am struggling for a while now with the following workflow:
batch exporting all text (text only) as plain text, from multiple Indesign files, applying <em> and <strong> for all the bold and italics (for instance: Molluptas verion <strong>nossum</strong> idist <em>doluptatet</em> maiorerum quiaspienit, cum erferiosapis eos expe nonsequas verumquae dolor sim eos doluptatiur autet lab idicili beatum deliquat).
Properly tagging the styles, will be too time consuming, as these documents are as old as 1998, have inconsistent untagged styles; it will mean to manually open up each file, assign tags for all styles (not to mention that might be some local overwrites).
This task is necessary to have all the archive, available on wordpress website.
Any feedback would be much appreciated it.
What exactly do you want us to suggest? If you want this to work you’ll need to set your files up correctly and that means using character styles that are properly mapped to tags and then exporting the HTML out of InDesign.
It sounds like you need a script.
A script could cycle though your text looking for text styled with italic, oblique, bookitalic, etc. to cover all the possible font attributes and tag the text.
Someone in the ID scripting forum may be able to help you write this.
Thanks for your input Bob. As mentioned, to properly set the files, means basically to open each of them and do this work. I have to do this for all the archive (about 3000 Indesign and QuarXpress files). If I am to roughly time this work will be about 750 hours.
I was hopping for a miracle-script or something. I refuse to think I have to do this be hand.
There may well be a script or you may have pay someone to write it. All I can tell you is that any text not mapped to a style will be a mess in the HTML code.
Can’t comment on Quark files. I have no clue what they’re capable of.
Thanks for your suggestion Bob.
Unfortunatly is more complicated than that as few sets of fonts are being used (and they come with different font styles as italic, pro italic, condensed italic, oblique, etc)