We recently did some tests with PDF to ePub export (different settings, different tools). What we saw is that none of the tools were able to respect paragraph and text styles. They just threw some text back to the screen (ePub Reader). This brought us to the philosophical discussion whether it is possible to export / add character and paragraph style information in a PDF, as we know it from e.g. Adobe InDesign (via the content stream, through metadata) so "a" tool can access that information to use it for exporting (properly) to ePub or another file format. So is there a PDF-standard or 'variant' (PDF/X, PDF/VT, PDF/A, ISO 32000 as such) that would make such a construction possible? And if so, would 'a' tool (e.g. PDF to ePub conversion tool) be able access that information to translate the style info etc. to another file format (e.g. a proper e-Pub file). BTW: we tried 'Create tagged PDF' out of Adobe InDesign, but that didn't help the outcome very much.
Thanks in advance!
Yes, it is quite possible and in fact Adobe InDesign will/can produced Tagged PDF that incorporates this information.
Of course, that means that tools needs to respect it when they read/convert the PDF. to my knowledge, NONE of the 3rd party PDF conversion tools support PDF tags/structure ☹.