39 Replies Latest reply on Mar 23, 2011 10:11 AM by Peter Grainge

    RoboHelp HTML 9 frequently crashes

    Chrissy_1234

      Hi,

       

      I'm using the latest version of RoboHelp 9. While doing the most rudimentary tasks in my project, like formatting text using the Design window and editing HTML code, RoboHelp will either hang or crash unexpectedly. This is happening at least five times a day recently. Many of the pages incidentally contain Word HTML tags and other junk code from Word that I remove manually--don't know if that has to do with the crashing. I can't figure out what's going on.

       

      My computer's specs are the following:

      Pentium Dual-Core CPU 2.5 GHz

      4GB installed memory

      Windows 7 Ultimate 64-bit OS

       

      Any feedback would be much appreciated.

       

      Thanks,

      Chris

        • 1. Re: RoboHelp HTML 9 frequently crashes
          Peter Grainge Adobe Community Professional (Moderator)

          Try simply deleting the CPD file so that RoboHelp creates a new when you open the project.

           

           

           


          See www.grainge.org for RoboHelp and Authoring tips

           

          @petergrainge

          • 2. Re: RoboHelp HTML 9 frequently crashes
            Captiv8r Adobe Community Professional & MVP

            Welcome to our community, Chris

             

            Where are your project files? Are you working on them while they are stored on a network drive? If so, that likely explains the crashes. Project files should only ever be edited while they are stored on local media such as your C drive. If they are on a network, you are engaging in a conversation that is causing issues.

             

            Cheers... Rick

             

             

            Helpful and Handy Links

            RoboHelp Wish Form/Bug Reporting Form

            Begin learning RoboHelp HTML 7, 8 or 9 within the day!

            Adobe Certified RoboHelp HTML Training

            SorcerStone Blog

            RoboHelp eBooks

            • 3. Re: RoboHelp HTML 9 frequently crashes
              Mona girdhar

              Hi Chris,

               

              Can you please let us know if Robohelp is crashing for this particular project or Robohelp Crashes even when you try to edit a new project.

               

              Thanks

              Mona

              • 4. Re: RoboHelp HTML 9 frequently crashes
                RoboAsh Level 2

                I will just repeat one of the question

                 

                1. Are you working on a project placed on a network location (then copy it locally and work on the local copy) - this seems to be the most likely cause in your case!

                 

                2. How many topics are there in your project

                3. How many topics are opened in the RoboHelp editor (there should not be more that 20-25 topics open at a time in the Design editor)

                 

                The redundant code pertaining to word has a least possibilities of causing the crash

                 

                Ashish

                • 5. Re: RoboHelp HTML 9 frequently crashes
                  Chrissy_1234 Level 1

                  Hi everyone,

                   

                  My project as well as the program are running on the C drive. The project is contained in the default location--Documents, My RoboHelp Projects. Nothing related to my project resides on the network.

                   

                  There are 76 topics in my project.

                   

                  This is the first main project I am working on with this version of RoboHelp. I have no more than five topics open at any given time in the editor, but more often than not only one topic is open. The crashes happen if I have one page open or multiple ones, the number of topics that are opened at any given time doesn't seem to matter. And if I remember correctly the program crashes even if there is no Word HTML code contained in the file.

                   

                  I will try deleting the file that one of the other respondents suggested and see what happens, then post again here.

                   

                  Thanks.

                  • 6. Re: RoboHelp HTML 9 frequently crashes
                    Chrissy_1234 Level 1

                    Hi, Peter,

                     

                    I deleted the file as you suggested, but it did not solve the problem. At the time I was editing a topic -- a "basic" HTML page riddled with Word HTML and other erroneous Word code. While simply deleting all that nonsensical and irrelevant code, RoboHelp crashed -- this has become a normal thing. And I don't want to find my project corrupted one day because of these crashes. 

                     

                    I don't know if Adobe support is monitoring these forums, but can't anyone there simply replicate what I'm doing and see if they get the same results using my computer specs? The HTML files created in Word incidentally may or may not be saved using that "filtered" option by the way, it doesn't matter, RoboHelp will crash either way. Perhaps it's an application conflict with 64-bit Windows 7?

                    • 7. Re: RoboHelp HTML 9 frequently crashes
                      RoboAsh Level 2

                      Did you imported these Word files to generate the topics, and the HTML code cleaning you are talking about should be done very cautiaosly because if the backend HTML of the topic is incorrect then topic parsing can also cause the crashers.

                       

                      As we see that the file missing symbols are showing up so the deletion of .cpd file will not help as the .fpj is also incorrect itself.

                       

                      Can you please zip and share on the physical folders (HTML files with other resourses and .fpj file) @ ashishchandra.gupta@gmail.com then, If there are some particular topics which crash RoboHelp with higher probabitity then do share it

                       

                      Ashish

                      • 8. Re: RoboHelp HTML 9 frequently crashes
                        Chrissy_1234 Level 1

                        Hi Ashish,

                         

                        The HTML files that had been created in Word were imported into the project as HTML files; they were not imported as .doc files that were converted to XHTML by RoboHelp. I have been very careful cleaning the code, though.

                         

                        Unfortunately I cannot send any of the project files to you because the information contained in the project is considered confidential information by the company I work for.

                         

                        Thanks,

                        Chris

                        • 9. Re: RoboHelp HTML 9 frequently crashes
                          Mona girdhar Level 1

                          Hi Chris,

                           

                          Instead of importing the HTML files created from the Word, Import the word document directly in RoboHelp. The Import in Robohelp will automatically clean up the extra code. Hope this will resolve your problem.

                           

                          Thanks,

                          Mona

                          • 10. Re: RoboHelp HTML 9 frequently crashes
                            Chrissy_1234 Level 1

                            Hi Mona,

                             

                            Unfortunately, that's not possible as I do not have access to the original .doc files, only the HTML files created by others have been made available to me. I have to work with what I already have.

                             

                            Thanks,

                            Chris

                            • 11. Re: RoboHelp HTML 9 frequently crashes
                              Mona girdhar Level 1

                              Hi Chris,

                               

                              If you can't send the HTML Files. Can you please do the following:-

                               

                              1. Replace your Confidential content with some Dummy Content in the HTML File

                              2. Import that HTML file in Robohelp

                              3. Try Editing this in Robohelp

                               

                              If Robohelp still crashes, please send me the HTML file at monagirdhar@gmail.com . I think that the issue is coming because of some HTML structure.

                               

                              Thanks,

                              Mona

                              • 12. Re: RoboHelp HTML 9 frequently crashes
                                MergeThis Level 4

                                Chris, unfortunately, HTML conversion as practiced by Microsoft is a train wreck. We were struggling with a series of "HTML" files that had been "converted" from XLS spreadsheets. When instructed to generate WebHelp output from these files, RoboHelp gagged and sputtered for long periods of time, coughed up a few hairballs, and eventually (reluctantly) produced our output. It was touch-and-go, at times. Part of the problem is that MS has produced, in its "conversion," what it feels is needed: an output file for web viewing. RoboHelp, on the other hand, expects files similar to its own source files, which it then prepares for output. When it encounters some foreign, already-prepared-output code, well, there's the rub!

                                 

                                We "solved" our problem by renaming those XLS-to-HTML files to .html instead of .htm, and adding them to Baggage Files. This structure allowed RoboHelp to simply pass them along, ungenerated, to the output. Since we were not using the default RoboHelp search (using Zoom), we were still able to include these files in search.

                                 

                                Since you're presented with less options, I might suggest the nuclear option: that is, do your own filtering of this MS mish-mash by "cleansing" the content through a pure text editor such as Notepad.

                                 

                                1. Open the Word "HTML" file, Select All, Copy.
                                2. Open Notepad, Paste.
                                3. In Notepad, Select All, Copy.
                                4. Open the RH topic (new or existing) in Design View, empty any content, and Paste from Notepad.
                                5. In the RH topic, set your formatting, styling, links, etc.

                                 

                                If these files are continuously updated and dumped on you for inclusion, you might want to get a good Compare utility (I use ExamDiff Pro) to identify only the changes, which you then manually add to your topics (without ever allowing them anywhere near your project!).

                                 

                                Try it with a few problematic files, as a test. It might sound like a lot of work, but that initial "cleansing" is a one-off effort, and updates would be a breeze (or a treat, as some say).

                                 

                                 

                                Good luck,

                                Leon

                                • 13. Re: RoboHelp HTML 9 frequently crashes
                                  Chrissy_1234 Level 1

                                  Hi Leon,

                                   

                                  Your suggestion did not do anything except copy and paste the code from one application to another unchanged.

                                   

                                  It seems there is no other solution other than going into the HTML code and deleting the junk that Word left in there. I found that some tags and attributes, especially the Span and the SpellE, GramE and so forth can be removed in the Design editor using the right click, Remove Attributes option, but still it's not an elegant solution. Or else the junk can be removed by using the Replace option in the HTML editor. But all the Word style attributes and junk at the top of the code has to be removed by hand it seems. And as a result of selecting and deleting that stuff, RoboHelp crashes, which is the problem I have.

                                   

                                  Adobe developers should figure out a way to implement a "Clean HTML" feature to delete residual junk from files created in Word or any other word processor for that matter saved as HTML, or else figure out a way where the program doesn't crash when people do it themselves. I have tried such cleaning apps available online but none really get the job done.

                                   

                                  Anyway, thanks.

                                   

                                  Chris

                                  • 14. Re: RoboHelp HTML 9 frequently crashes
                                    Willam van Weelden Adobe Community Professional & MVP

                                    Hi,

                                     

                                    When you create a web page from Word, be sure to choose to save as "HTML, filtered". This will result in a much cleaner (though still polluted) file. If you choose the regular HTML, Word crams the page with Word code.

                                     

                                    For cleaning word HTML, try searching google for "clean Word html" and that will give you some nice hits for cleaning the word html. Also, Dreamweaver CS5 has a utility to clean up Word generated HTML. Last but not least, you could also write a script in RoboHelp to clean the code of a html document and import it into the project. The easiest way out may be to simply import the Word document itself in RoboHelp.

                                     

                                    Greet,

                                     

                                    Willam

                                    • 15. Re: RoboHelp HTML 9 frequently crashes
                                      Mona girdhar Level 1

                                      Hi Chris,

                                       

                                      If you can't send the HTML Files. Can you please do the following:-

                                       

                                      1. Replace your Confidential content with some Dummy Content in the HTML File

                                      2. Import that HTML file in Robohelp

                                      3. Try Editing this in Robohelp

                                       

                                      If Robohelp still crashes, please send me the HTML file at monagirdhar@gmail.com . I think that the issue is coming because of some HTML structure.

                                       

                                      Thanks,

                                      Mona

                                      • 16. Re: RoboHelp HTML 9 frequently crashes
                                        MergeThis Level 4

                                        In step 1, you did open the Word "HTML" file in a browser, didn't you?

                                         

                                        If so, I can't understand how any underlying code could have been transferred between steps 1 and 2.

                                         

                                         

                                        Leon

                                        • 17. Re: RoboHelp HTML 9 frequently crashes
                                          A25CharacterScreenName

                                          This sounds like a job for HTML Tidy (http://tidy.sourceforge.net). Tidy is a command-line program, so you need to be comfortable with launching a program from a command window, but not only is it designed to clean invalid content, it has a specific option to clean M$ Word files (--word-2000 yes).

                                           

                                          RoboHelp actually uses Tidy when you select "W3C Compliance" from the "File" menu (libtidy.dll is stored in C:\Program Files\Adobe\Adobe RoboHelp 8\RoboHTML). I haven't yet tried setting the environment variable to see if I can change RoboHelp's default Tidy configuration, but will do so when I get the chance, and report back.

                                           

                                          If you have trouble figuring out how to use Tidy, check back and I will try to respond to any specific questions you may have.

                                          • 18. Re: RoboHelp HTML 9 frequently crashes
                                            Chrissy_1234 Level 1

                                            Does anyone know if there is a performance conflict with Windows 7 64-bit? RoboHelp is crashing now when I am editing simple HTML code. It crashed on me three times in two hours. Is anyone else in the world having these issues with RoboHelp 9 running on Windows 64-bit or is it just me?

                                             

                                            Thanks.

                                            • 19. Re: RoboHelp HTML 9 frequently crashes
                                              Peter Grainge Adobe Community Professional (Moderator)

                                              Chrissy

                                               

                                              First it may help you re sending files to know that both RoboAsh and Mona are Adobe employees who are giving their time to support the forums.

                                               

                                              Second, as A25... says, HTML Tidy is a good way of cleaning the Word "HTML". Do the topics contain meta tags describing the source document and the fact that it was created in Word? If they do, delete those tags. See Importing Word Docs on my site for more information about the process. It  hasn't been updated for 9 but the process is the same.

                                               


                                              See www.grainge.org for RoboHelp and Authoring tips

                                               

                                              @petergrainge

                                              • 20. Re: RoboHelp HTML 9 frequently crashes
                                                Chrissy_1234 Level 1

                                                Hi Peter,

                                                 

                                                I had been removing all the Microsoft-related meta tags and declarations at the top where they appeared. I used HTML Tidy before on the web site they have --- http://infohound.net/tidy/ -- copying and pasting the code there and it always reports zero errors. Again, I cannot import the original Word files because I don't have them, I'm simply working with the HTML files that were created in Word, and I can't send them to anyone to examine for reasons of confidentiality.

                                                 

                                                Thanks for your help,

                                                Chris

                                                • 21. Re: RoboHelp HTML 9 frequently crashes
                                                  Peter Grainge Adobe Community Professional (Moderator)

                                                  It would be nice to see a topic but I doubt that is the problem. However you say that is not possible.

                                                   

                                                  I think it is something else and clearly so does RoboAsh, an Adobe employee. However you say that is not possible.

                                                   

                                                  I really don't know what to suggest as we are in the situation where you car is broken but you will not let the mechanic see it.

                                                   

                                                  I'm not meaning to sound awkward, just wanting to explain the difficulty of helping you as we would wish.

                                                   


                                                  See www.grainge.org for RoboHelp and Authoring tips

                                                   

                                                  @petergrainge

                                                  • 22. Re: RoboHelp HTML 9 frequently crashes
                                                    Chrissy_1234 Level 1

                                                    Look, it's not hard to recreate this problem. Nearly all of the files were saved without the filtered option. Some of them have lenghty tables. All of them have Word styles converted into HTML. If anyone has a .doc file that is more than five or so pages long--the longer the better--save it as an HTML file using Word 2007 or higher. Then import it into RoboHelp. If RoboHelp doesn't crash as you edit the code and delete all the junk, then perhaps there is something wrong with my files, but I doubt that because I finally managed to clean them all despite the frequent crashes, yet RoboHelp still crashes if I open the page and make other minor adjustments, like deleting tags I don't want.

                                                     

                                                    I can't send any of the files, I am a technical writer working for a software company. If I could I would. They're not going to like that I send confidential files to anyone. So if the mechanic can't help, then that's fine. I will keep putting up with the crashes.

                                                     

                                                    Thanks,

                                                    Chris

                                                    • 23. Re: RoboHelp HTML 9 frequently crashes
                                                      Peter Grainge Adobe Community Professional (Moderator)

                                                      Chrissy

                                                       

                                                      I did make it clear that I understood your situation. The difficulty here is that others are not reporting this problem so either it is your documents or your project. Without access to those it simply isn't possible to help and I think from the number of replies it is clear people do want to help.

                                                       

                                                      If you cannot send the files or the project even to Adobe, then it does become difficult to help. I can understand not sending them to some unknown person but sending them to Adobe is surely different? Can you not get permission?

                                                       

                                                      The only other thing I can suggest is that you create a new project and import one of the problem topics into that. Does the problem continue?

                                                       


                                                      See www.grainge.org for RoboHelp and Authoring tips

                                                       

                                                      @petergrainge

                                                      • 24. Re: RoboHelp HTML 9 frequently crashes
                                                        A25CharacterScreenName Level 1

                                                        I've had similar problems with RoboHelp 8 -- usually when I edit the HTML by hand and create a not-well-formed file. If you're editing HTML by hand you must remember to promptly remove the correct end tag when you remove a start tag.  RoboHelp is not very good at coping with errors in HTML files, so I would do this in an editor other than RoboHelp, and re-validate the well-formedness of the file before opening it again in RoboHelp. Try installing Microsoft's Visual Web Developer 2010 Express, which is free with registration.

                                                         

                                                        Just checking the validity of the MSHTM files with Tidy is not good enough; you need to actually use it to strip the MS cruft. I suggest you go to tidy.sourceforge.net and actually download the program so you can run it yourself.

                                                         

                                                        Once you have it, open a command prompt and navigate to the folder where the MS files are stored. Then you can execute Tidy something like this:

                                                         

                                                        "c:\program files\tidy.exe" --word-2000 yes -o [name of the output file]  [name of the input file]


                                                        The --word-2000 options tells Tidy to strip all the Word cruft; the -o option says to use the following file name as the output file (if you don't specify an output file Tidy writes the output to "Standard out" -- usually the screen, but it can be redirected if need be). Then, import the tidied file that was created into RoboHelp, not the Word file. I'm not sure that this will solve your problem but I think there is a good chance it will, and at the very least you will end up with much cleaner files to work with.

                                                         

                                                        If you start using a lot of Tidy options its useful to collect them into a single configuration file, both to reduce the need for typing and to ensure correctness. The configuration file is passed to Tidy on the command line using the -config option. In this case your command line would look something like:

                                                         

                                                        "c:\program files\tidy.exe" -config myconfig.cfg -o [name of the output file]  [name of the input file]

                                                         

                                                        My own tidy configuration file, which I have placed in the root of my c: drive where I can always find it, is as follows:

                                                         

                                                        break-before-br: y
                                                        drop-font-tags: y
                                                        clean: n
                                                        force-output: y
                                                        indent: y
                                                        indent-spaces: 2
                                                        numeric-entities: y
                                                        output-xhtml: y
                                                        output-encoding: ascii
                                                        input-encoding: win1252
                                                        tidy-mark: n
                                                        replace-color: y
                                                        merge-divs: n
                                                        merge-spans: n
                                                        wrap: 85

                                                         

                                                        I have created a simple batch file to tidy a file and create an output file having yet another .html extension. It is as follows:

                                                         

                                                        "%ProgramFiles%\tidy.exe" -config c:\tidy.cfg -o %1.html %1
                                                        pause

                                                         

                                                        I then associated this batch file with the ".htm" and ".html" extension in the Windows file explorer (Tools->Folder Options->File Types->Advanced) as a new Action named "Tidy". Now when I right-click on a file I can select "Tidy" and I will end up with a new, tidied file. If you were to do this same thing, I would use ".htm" instead of ".html" as the added output extension, as I find that RoboHelp likes to use that extension better.

                                                         

                                                        Should you have any further questions, don't hesitate to ask.

                                                        1 person found this helpful
                                                        • 25. Re: RoboHelp HTML 9 frequently crashes
                                                          Chrissy_1234 Level 1

                                                          Hi, A25... I will try that out. Thanks a lot for your help. That will most likely work.

                                                           

                                                          Regards,

                                                          Chris

                                                          • 26. Re: RoboHelp HTML 9 frequently crashes
                                                            Chrissy_1234 Level 1

                                                            The problem is deleting the MS Word 2007 style tags. I don't know if Word 2000 tags and 2007 tags are the same, I believe they aren't. So I cannot delete the following types of text automatically using TidyHTML:

                                                             

                                                               <w:WrapTextWithPunct/>

                                                             

                                                               <w:UseAsianBreakRules/>

                                                             

                                                               <w:DontGrowAutofit/>

                                                             

                                                               <w:SplitPgBreakAndParaMark/>

                                                             

                                                               <w:DontVertAlignCellWithSp/>

                                                             

                                                               <w:DontBreakConstrainedForcedTables/>

                                                             

                                                               <w:DontVertAlignInTxbx/>

                                                             

                                                               <w:Word11KerningPairs/>

                                                             

                                                               <w:CachedColBalance/>

                                                             

                                                              </w:Compatibility>

                                                            ...

                                                             

                                                            which continues seemingly forever, then:

                                                             

                                                             

                                                            <style>

                                                             

                                                            <!--

                                                             

                                                            /* Font Definitions */

                                                             

                                                            @font-face

                                                             

                                                                {font-family:"Cambria Math";

                                                             

                                                                panose-1:2 4 5 3 5 4 6 3 2 4;}

                                                             

                                                            /* Style Definitions */

                                                             

                                                            p.MsoNormal, li.MsoNormal, div.MsoNormal

                                                             

                                                                {margin:0in;

                                                             

                                                                margin-bottom:.0001pt;

                                                             

                                                                font-size:12.0pt;

                                                             

                                                                font-family:"Times New Roman","serif";}

                                                             

                                                            a:link, span.MsoHyperlink

                                                             

                                                                {color:blue;

                                                             

                                                                text-decoration:underline;}

                                                             

                                                            a:visited, span.MsoHyperlinkFollowed

                                                             

                                                                {color:purple;

                                                             

                                                                text-decoration:underline;}

                                                             

                                                            .MsoChpDefault

                                                             

                                                                {font-size:10.0pt;}

                                                             

                                                            @page Section1

                                                             

                                                                {size:8.5in 11.0in;

                                                             

                                                                margin:1.0in 1.25in 1.0in 1.25in;}

                                                             

                                                            div.Section1

                                                             

                                                                {page:Section1;}

                                                             

                                                            -->

                                                             

                                                            </style>

                                                             

                                                             

                                                            and this:

                                                             

                                                            <p class=MsoNormal style='margin-left:1.0in;text-indent:-1.0in'>

                                                             

                                                            and this:

                                                             

                                                            <span class=SpellE><span class=GramE>

                                                             

                                                            This is the junk I've been talking about, and TidyHTML will not get rid of it. I tried the TidyHTML web site, I tried the Tidy command line tool and other Tidy tools available with a GUI. It does not strip out this junk. And when I try to do it in RoboHelp, it crashes. I don't know what else I can write to better explain what I mean.

                                                             

                                                            Thanks,

                                                            Chris

                                                             

                                                             

                                                            • 27. Re: RoboHelp HTML 9 frequently crashes
                                                              Peter Grainge Adobe Community Professional (Moderator)

                                                              At the risk of sounding like a stuck record, can you not change the content of a document so that the junk remains and I can try to find a solution?

                                                               


                                                              See www.grainge.org for RoboHelp and Authoring tips

                                                               

                                                              @petergrainge

                                                              • 28. Re: RoboHelp HTML 9 frequently crashes
                                                                Chrissy_1234 Level 1

                                                                Visual Web Developer 2010 Express will not install on my system for some unexplained reason.

                                                                • 29. Re: RoboHelp HTML 9 frequently crashes
                                                                  Chrissy_1234 Level 1

                                                                  Of course. Thank you.

                                                                  • 30. Re: RoboHelp HTML 9 frequently crashes
                                                                    Peter Grainge Adobe Community Professional (Moderator)

                                                                    Good. Send it via my site when you are ready. As it will just be a document, ignore the instructions and attach it to the email.

                                                                     


                                                                    See www.grainge.org for RoboHelp and Authoring tips

                                                                     

                                                                    @petergrainge

                                                                    • 31. Re: RoboHelp HTML 9 frequently crashes
                                                                      Peter Grainge Adobe Community Professional (Moderator)

                                                                      A lot of the junk was removed by using Save As and selecting the Web Page - Filtered option instead of just Web Page.

                                                                       

                                                                      Then saving that as a DOCX file and importing that gave better results, no MsoNormal for instance.

                                                                       

                                                                      The remaining problems look to be related to the way the document was created, lots of tabs and spaces used instead of using a table.

                                                                       

                                                                      Chrissy is checking my findings.

                                                                       


                                                                      See www.grainge.org for RoboHelp and Authoring tips

                                                                       

                                                                      @petergrainge

                                                                      • 32. Re: RoboHelp HTML 9 frequently crashes
                                                                        Mona girdhar Level 1

                                                                        Hi Chrissy,

                                                                         

                                                                        Can you also send us the file in which you have changed the content, so that we can also look into this issue.

                                                                         

                                                                        If possible please send me the HTML file at monagirdhar@gmail.com

                                                                         

                                                                        Thanks

                                                                        Mona

                                                                        • 33. Re: RoboHelp HTML 9 frequently crashes
                                                                          A25CharacterScreenName Level 1

                                                                          Chrissy_1234 wrote:

                                                                          The problem is deleting the MS Word 2007 style tags. I don't know if Word 2000 tags and 2007 tags are the same, I believe they aren't. So I cannot delete the following types of text automatically using TidyHTML:

                                                                          [Examples snipped]

                                                                          I took the liberty of looking a little deeper into this for you. As others here have suggested, saving the files from M$Word as "filtered" HTML might help you get past your problem. I know you don't have access to the original M$Word documents, but that probably isn't a problem--you see, the reason that M$Word embedds all that junk into the HTML file is because it thinks you're going to want to re-open the file in Word, and not lose any of the word-specific information. If you have access to M$Word you could simply open the saved HTML in Word, then resave it "filtered". Of course, even Micro$oft "filtered" output still contains more junk that what you want, and for reasons that I once knew but have since forgotten, Tidy does a better job cleaning the unfiltered output than it does the filtered output.

                                                                           

                                                                          Examining the M$Word output, however, I noticed that most of the junk was embedded in comment tags (<!-- -->). Tidy has the option of removing comment blocks: "--hide-comments yes". So I took a simple 7 page M$Word document and saved it as HTML from Word 2007. I then added these two lines to my "tidy.cfg" file:

                                                                           

                                                                          hide-comments: y

                                                                          word-2000: y

                                                                           

                                                                          After running Tidy on my text document it was very, very clean, and had been reduced in size by about 2/3. All of the elements from the "w:", "m:", and "o:" namespaces were removed, as were all of M$Word's excessive use of the "style" attribute. My build of Tidy is from December 2008.

                                                                           

                                                                          I would suggest you try Tidy again on your original M$Word .htm files using the foregoing configuration options.This might get you over the hump; if not, let me know and I'll see if I can make some changes to Tidy that will get rid of what is still bothering you.

                                                                           

                                                                          (If, after importing clean HTML RoboHelp is still crashing on you, I can't offer any further help; at that point it's clearly a RoboHelp problem, not an HTML problem).

                                                                          • 34. Re: RoboHelp HTML 9 frequently crashes
                                                                            Chrissy_1234 Level 1

                                                                            Hi A25...

                                                                             

                                                                            It seems your solution is the best one for cleaning the Word code and style tags from the HTML files. I used Tidy UI since it's much simpler for me, implementing all of your config parameters that you mentioned in your responses. The result was pristine HTML. I simply copied the tidied HTML from Tidy UI and pasted it into RoboHelp's HTML editor. RoboHelp automatically added its own meta tags and made sure it was XHTML compliant. Then I applied my style sheet and that was it.

                                                                             

                                                                            But the crashing continues. This crashing problem is most definitely related to the HTML editor--whenever "heavy" code editing is being done (now it's simply copying and pasting code), the application seizes. If RoboHelp doesn't exit automatically, I routinely have to go into the Task Manager and force it to quit. So I really think Adobe support needs to open a bug and look into this crashing problem because given the price tag of this application, it's really unacceptable.

                                                                             

                                                                            Thanks very much for everyone's help. It seems we cannot do anything about the crashing; that's apparently Adobe's problem to resolve.

                                                                             

                                                                            Regards,

                                                                            Chris

                                                                            • 35. Re: RoboHelp HTML 9 frequently crashes
                                                                              Peter Grainge Adobe Community Professional (Moderator)

                                                                              I think there is still something to try. Next time a topic causes the crash, import that topic into a new project and see if it still causes a crash. That would confirm or reject the theory that I think A25... shares that this could be your project.

                                                                               

                                                                               


                                                                              See www.grainge.org for RoboHelp and Authoring tips

                                                                               

                                                                              @petergrainge

                                                                              • 36. Re: RoboHelp HTML 9 frequently crashes
                                                                                Jeff_Coatsworth Adobe Community Professional & MVP

                                                                                Also, could the bug (http://kb2.adobe.com/cps/892/cpsid_89293.html) that affects CBT and flipping  back & forth to Design View be a cause of Chrissy_1234's crashing? Would it hurt her to try it out?

                                                                                • 37. Re: RoboHelp HTML 9 frequently crashes
                                                                                  Peter Grainge Adobe Community Professional (Moderator)

                                                                                  I don't think CBTs are involved and the crashing occurs whilst in HTML rather than when switching. Nonetheless you are right it is worth a try, stranger things happen at sea!

                                                                                   


                                                                                  See www.grainge.org for RoboHelp and Authoring tips

                                                                                   

                                                                                  @petergrainge

                                                                                  • 38. Re: RoboHelp HTML 9 frequently crashes
                                                                                    A25CharacterScreenName Level 1

                                                                                    Chrissy_1234 wrote:

                                                                                     

                                                                                    [snip]

                                                                                    But the crashing continues. This crashing problem is most definitely related to the HTML editor--whenever "heavy" code editing is being done (now it's simply copying and pasting code), the application seizes.

                                                                                     

                                                                                    As I mentioned earlier, I have seen this kind of behavior with RoboHelp 8--always after I have edited the HTML code by hand. RoboHelp does not crash, it just freezes up and has to be killed from the task manager. It hasn't bothered me to the point where I have taken the time to do the kind of testing needed to actually pinpoint the problem; I figure that that's a job for Adobe's engineers. I don't know what the internals are, but it seems to be best to let RoboHelp figure out the HTML, and stick with the design view for editing, and use the HTML view for only minor tweaks (RoboHelp, like most WYSIWYG editors seriously abuses the <p> tag).

                                                                                     

                                                                                    What I would suggest is rather than trying the cut-and-paste method you let Tidy actually build a new HTML file, then import the new file into RoboHelp (File->Import->HTML/XHTML File). It may not be necessary, but you might want to exit RoboHelp after the import is complete before attempting to edit the file. Thereafter, try to limit the changes you make in the HTML editor. The HTML that RoboHelp generates is not great, but RoboHelp is the tool you've chosen to use, so you kind of have to live with it.

                                                                                    • 39. Re: RoboHelp HTML 9 frequently crashes
                                                                                      Peter Grainge Adobe Community Professional (Moderator)

                                                                                      A25...

                                                                                       

                                                                                      Clearly you have an interest in this problem so would you please try a simple test in one of your projects to see if you get the same result.

                                                                                       

                                                                                      Save either the original Word document or the HTML version of it using Web Page - Filtered in Word. Then save that HTM file as DOCX and import that Word document. Chris' file had lots of indents in the P tag and lots of spaces but that was a product of the way the document was created. In every other respect the code was better than any other method I have seen.

                                                                                       

                                                                                      It would really help to know that you get the same result and assuming you do, maybe it will simplify things for you.

                                                                                       


                                                                                      See www.grainge.org for RoboHelp and Authoring tips

                                                                                       

                                                                                      @petergrainge