This is not possible with Adobe Acrobat.
Thanks Bernd for your quick response.
I've another question: Does Adobe Acrobat Pro has command line utility or API for converting PDF to HTML?
For the API look at the Acrobat SDK.
I checked the Acrobat DC SDK and unable to find any API for saving PDF to HTML.
Any pointer to the API method or which class to look into will be helpful.
For few sample PDFs it converts to HTML with proper formatting but for others the PDF formatting is lost in HTML, i.e., images and text are not properly aligned in HTML output.
You can view the sample PDF and its output at:
Below is the VB.net code I'm using for conversion:
Dim srcDoc As Acrobat.CAcroPDDoc = CreateObject("AcroExch.PDDoc") srcDoc.Open(sPDFPath) Dim jsObj As Object = srcDoc.GetJSObject() jsObj.saveAs(sHtmlPath, "com.adobe.acrobat.html")
Am I missing something? Is there any property or attribute of the JSObject to be set to get the perfect html output?
There is no such attribute or property.