I am importing text from a document which has chinese. I need to remove all of the chinese text. Any advice with what script to use? Thanks
Andy
Well, that's cool. I'd never seen the kanji marker before. Didn't even know it was there, I've been specifying Unicode ranges in GREP queries to achieve a similar effect.
Does it really remove everything? Let me check... Yikes. This query deletes every line in which Chinese or Japanese glyphs occur. Seems like overkill to me. If you want to delete any paragraph, story, footnote or cell (thx Jongware for WhatTheGrep) that contains any Chinese text whatsoever, even if it's just one glyph in the middle of an entire English paragraph, then this GREP will work for you. Otherwise, maybe some additional explanation of what you need (did someone give you an uncleaned bilingual Trados file?) might be in order. Or you could tweak the GREP to suit, if you're into that sort of thing.
I agree with Joel that it's a shame to call Chinese characters by the Japanese term for "Chinese characters." Of course, it is also a shame that Unicode has sanctified use of "CJK Ideographs" although many students of Chinese culture would prefer "logographs": from their very beginning, most Chinese characters were not "pictures," as Chinese scholars recognized long, long ago.
As for the GREP metacharacter "~K", I guess it goes back to IDCS3 (for search only, not replace) and was paired with "^K" in IDCS4 Text searches. Who knew? I wish I did. Maybe David Blatner did but chose not to mention it in his October 2009 InDesign Secrets piece on using GREP to find Japanese with Unicode text ranges. The interesting point there, of course, is that the ranges are discontinuous.
Is there a way to find out just what ranges "~K" covers? A quick test suggests in IDCS4, "~K" doesn't find the 42,711 characters added in CJK Extension B in Unicode 3.1 (2001) -- though you can find them with "[\x{20000}-\x{2A6DF}]+".
David
Thankyou for your very insightful and thorough answers.
Firstly, I am not trying to translate, which is why the first grep is extremely helpful.
There is both an english and chinese version of this book, and I am to design the english version (with exclusive english only.)
The text I have been given included both languages, hence my need to find a solution to effectively remove the chinese text.
Thanks again,
Andy
North America
Europe, Middle East and Africa
Asia Pacific