2 Replies Latest reply on Sep 9, 2012 12:10 PM by jmkicap

    Thai script line breaking in CS6

    jmkicap

      Hello,

       

      I am having serious difficulty finding a way to implement acceptable line breaking in a long document I am working in Thai script. I am working with InDesign CS6, using the World-Ready Composer. I have tried a variety of differnet Thai fonts. The text I am flowing into the document has already been marked (by the author) with a large number of additional acceptable break locations (which I have converted to different types of spaces). The text still breaks randomly in the middle of strings (and also at the space locations -- but not seemingly with any priority).

       

      I have tried using the standard Paragraph Composer as well. Some line spacing changes, and cursor behaviour over some strings is different, but the bad line breaks still happen.

       

      I have also tried the same text in CS5 and CS4. Same problem.

       

      The Language setting for styles is set to 'No Language'. I was certainly hopeful that the extra breaks marked in the text with thin or zero width spaces would provide enough line break flexibility. Does not seem so.

       

      Would anyone here have any advice, or other (better) experience working with Thai?

       

      Thanks

       

      Jeff

        • 1. Re: Thai script line breaking in CS6
          Joel Cherney Adobe Community Professional & MVP

          The text I am flowing into the document has already been marked (by the author) with a large number of additional acceptable break locations (which I have converted to different types of spaces). The text still breaks randomly in the middle of strings (and also at the space locations -- but not seemingly with any priority).

           

          The "priority" is based off of no rhyme or reason that I can detect. When I process Lao or Cambodian in InDesign (which I don't expect to break correctly, in any circumstance, unlike Thai), I just use a GREP style that applies No Break to characters between any space, zero-width or otherwise. I'm assuming that the Thai author used zero-width space, but it could have been some other marker, I suppose. If you don't have any experience w/GREP, just tell us what the different types of spaces are where breaks are permissible and we can gin up a GREP query for you to use.

          1 person found this helpful
          • 2. Re: Thai script line breaking in CS6
            jmkicap Level 1

            Joel,

             

            Thanks very much for your reply. This was quite helpful. I was able to wrap all runs of Thai script letter characters with no break. I am actually importing InDesign Tagged Text into InDesign. I used the following regular expression to search and replace in the tagged text files.

             

            Search: ([\u0E01-\u0E5B]+)

            Replace: <cNoBreak:1>\1<cNoBreak:>

             

            Using a GREP style in InDesign is an easier solution. Unfortunately there is a chance in the text I am working with that the resulting style will become nested within a piece of text already wrapped with a character style, so the local noBreak instruction is the only reliable way.

             

            The World Ready Composer has added so much needed support for the OpenType font features required for complex script rendering. Much appreciated! I do hope in future versions of InDesign that support for these South Asian languages will continue to improve by adding the necessary text (morphology) analysis needed to implement real support for line breaking. (The characters in Thai belong to the Unicode line breaking class "SA", which indicates that software needs to provide this added support -- such as a dictionary lookup, or other analysis, to indicate which characters should be allowed to break after)

             

            It feels like InDesign (at this point) is just treating all characters in these Thai text runs as an acceptable breaking location, and doing its best to stick close to the prefered justification word and letter spacing values.

             

            Thanks, again for the helpful idea.

             

            Jeff