Skip navigation
Jeff_Parker
Currently Being Moderated

dc:title in PDF file from Word 2010 - where does it come from?

Jul 5, 2012 11:12 PM

Tags: #acrobat_pro_x #word_2010 #conversion_problem

Hi, I have Office 2010 and Acrobat X Pro (all up to date) and am creating a PDF file from inside Word using PDFMaker 10.1.

 

In the resultant PDF file, the Document Properties shows in Description a Title: of "Terms Of Reference", which is not anywhere in the Word file, or from what I can see anywhere in its metadata.

 

By going, in the PDF file, to Additional Metadata..., then to Advanced, I see that the Dublin Core Properties dc:title element has "Terms Of Reference" as the value for [x-default].

 

My question is where does this value come from? Is it somewhere in the Word doc or somewhere in my system?

 

And how can I get it so that the actual Title of the Word document is put there?

 

Any advice and assistance would be greatly appreciated!

 

Thanks and kind regards, JP

 
Replies
  • Currently Being Moderated
    Jul 6, 2012 3:55 AM   in reply to Jeff_Parker

    If you're saving to Adobe PDF and have the "Convert document information" option ticked on the PDFMaker settings dialog, then the metadata for the PDF reflects whatever Word has defined in its document properties panel (via the prepare menu or right-click a file in Windows Explorer and choose Properties>Summary). If you untick the option to convert the document info, PDFMaker should leave everything blank aside from the basics (creating software ID, dates, etc.).

     

    Printing to PDF is a slightly-different matter, in that case the PDF title is generated using the filename and application that started the print job.

     
    |
    Mark as:
  • Currently Being Moderated
    Jul 7, 2012 6:35 AM   in reply to Jeff_Parker

    dc:title (which equates to the Title field on the PDF document properties) should be collected from the Title attribute entered in Word - I've not seen a case where it's collected by PDFMaker from anywhere else. There may be something strange going on with the Word (DOCX) file in this case, or with the installation.

     

    Try inspecting the raw code of the DOCX file:

     

    • Rename it to a .ZIP extension (that's what it actually is)
    • Open the ZIP file and extract the tree of XML files to a temporary folder
    • Search the docProps/core.xml for your wayward title

     

    If you convert other DOCX files to PDF, so the titles behave properly?

     
    |
    Mark as:
  • Currently Being Moderated
    Jul 9, 2012 12:55 AM   in reply to Jeff_Parker

    OK, now it makes sense.

     

    If an OOXML file uses sections, the <TitlesOfParts> tag holds the names for each one. Because in your file there's only one section, Acrobat is reading the section name in preference to the overall title of the file - as you're in effect only printing one section.

     

    See http://msdn.microsoft.com/en-us/library/windows/desktop/aa380374%28v=v s.85%29.aspx for help with the tag names.

     
    |
    Mark as:

More Like This

  • Retrieving data ...

Bookmarked By (0)

Answers + Points = Status

  • 10 points awarded for Correct Answers
  • 5 points awarded for Helpful Answers
  • 10,000+ points
  • 1,001-10,000 points
  • 501-1,000 points
  • 5-500 points