Hi, I have Office 2010 and Acrobat X Pro (all up to date) and am creating a PDF file from inside Word using PDFMaker 10.1.
In the resultant PDF file, the Document Properties shows in Description a Title: of "Terms Of Reference", which is not anywhere in the Word file, or from what I can see anywhere in its metadata.
By going, in the PDF file, to Additional Metadata..., then to Advanced, I see that the Dublin Core Properties dc:title element has "Terms Of Reference" as the value for [x-default].
My question is where does this value come from? Is it somewhere in the Word doc or somewhere in my system?
And how can I get it so that the actual Title of the Word document is put there?
Any advice and assistance would be greatly appreciated!
Thanks and kind regards, JP
If you're saving to Adobe PDF and have the "Convert document information" option ticked on the PDFMaker settings dialog, then the metadata for the PDF reflects whatever Word has defined in its document properties panel (via the prepare menu or right-click a file in Windows Explorer and choose Properties>Summary). If you untick the option to convert the document info, PDFMaker should leave everything blank aside from the basics (creating software ID, dates, etc.).
Printing to PDF is a slightly-different matter, in that case the PDF title is generated using the filename and application that started the print job.
Hi Dave, many thanks. I realise all that and have put the data in the Word fields that I want. But the Title in the resultant PDF is not from any document metadata that I have entered, or from the metadata that Windows has with the file (at least that is visible via Windows Explorer). I want to know where Acrobat gets the data for dc:title from, so then I might find it and change it.
dc:title (which equates to the Title field on the PDF document properties) should be collected from the Title attribute entered in Word - I've not seen a case where it's collected by PDFMaker from anywhere else. There may be something strange going on with the Word (DOCX) file in this case, or with the installation.
Try inspecting the raw code of the DOCX file:
If you convert other DOCX files to PDF, so the titles behave properly?
Hi Dave, many thanks. I did as you suggested and found that the file called app.xml in the folder docProps had this content:
<?xml version="1.0" encoding="UTF-8" standalone="true"?>
-<Properties xmlns:vt="http://schemas.openxmlformats.org/officeDocument/2006/docPropsVTypes" xmlns="Normal.dotm808401390379252Microsoft" _mce_href="http://schemas.openxmlformats.org/officeDocument/2006/exte nded-properties"><Template>Normal.dotm</Template><TotalTime>808</Total Time><Pages>40</Pages><Words>13903</Words><Characters>79252</Character s><Application>Microsoft">http://schemas.openxmlformats.org/officeDocu ment/2006/extended-properties"><Template>Normal.dotm</Template><TotalT ime>808</TotalTime><Pages>40</Pages><Words>13903</Words><Characters>79 252</Characters><Application>Microsoft Office Word</Application><DocSecurity>0</DocSecurity><Lines>660</Lines><Para graphs>185</Paragraphs><ScaleCrop>false</ScaleCrop>-<HeadingPairs>-<vt :vector baseType="variant" size="2">-<vt:variant><vt:lpstr>Title</vt:lpstr></vt:variant>-<vt:var iant><vt:i4>1</vt:i4></vt:variant></vt:vector></HeadingPairs>-<TitlesOfParts>-<vt:vector baseType="lpstr" size="1"><vt:lpstr>Terms Of Reference</vt:lpstr></vt:vector></TitlesOfParts><Manager>[deidentified]</Manager><Company>[deidentified]</Company><Lin ksUpToDate>false</LinksUpToDate><CharactersWithSpaces>92970</Character sWithSpaces><SharedDoc>false</SharedDoc><HyperlinksChanged>false</Hype rlinksChanged><AppVersion>14.0000</AppVersion></Properties>
Unfortunately the nice formatting disappeared when I pasted in here. I've bolded the section above that includes the errant "Terms of Reference" title. Am not sure what this all means and how the vt:lpstr value becomes the doc's title when the PDF is created.
Interestingly, the file called core.xml in the same folder had a dc:title specified with the document's correct name, i.e. as I had entered it in Word's Document Properties.
I also found the following in a file called header1.xml in the folder called word:
-<w:fldSimple w:instr=" TITLE "Terms Of Reference" ">-<w:r w:rsidR="00DF3DFE"><w:t>Terms Of Reference</w:t></w:r></w:fldSimple>
I'm pretty sure these are the only instances where "Terms of Reference" appear in the zip file.
In answer to your last question, I created a new Word doc from a report template I use, specified a Title in the Document Properties (i.e. in the Summary tab), did a Create PDF inside Word, and the correct title appears in the resultant PDF file.
So it appears as though "Terms of Reference" is inside my Word doc somewhere that I can't see, and also is stored there in a way that Acrobat prefers to use that for the doc's Title instead of where it is specified in a way that you'd reasonably expect it should be used, i.e. in the Document Properties.
Thanks again Dave for your assistance with this. Much appreciated.
Kind regards, JP
OK, now it makes sense.
If an OOXML file uses sections, the <TitlesOfParts> tag holds the names for each one. Because in your file there's only one section, Acrobat is reading the section name in preference to the overall title of the file - as you're in effect only printing one section.
See http://msdn.microsoft.com/en-us/library/windows/desktop/aa380374%28v=v s.85%29.aspx for help with the tag names.
Hi Dave, I'm glad it makes sense to you! I wasn't aware that sections had names. Do you know how I can edit the name of the one section in my doc, in Word?
Or, is there a way to edit the XML? and not destroy the document... If so, any guidance on how to do this please, and so that Acrobat will pick up the Title I specified in the Document Properties, i.e. not by replacing "Terms of Reference" with the text I want, which would hard code the value and not solve the problem?
Europe, Middle East and Africa