* setup2.sgml (setup-locale-ov): Document CJK ambiguous width change

for non-Unicode charsets.
	* new-features.sgml (ov-new1.7.8): Mention CJK ambiguous width change.
	Drop redundant NT4 pre-SP4 support entry.
This commit is contained in:
Corinna Vinschen 2010-11-18 11:05:46 +00:00
parent 617dc68bfe
commit 1570248800
3 changed files with 21 additions and 12 deletions

View File

@ -1,3 +1,10 @@
2010-10-18 Andy Koppe <andy.koppe@gmail.com>
* setup2.sgml (setup-locale-ov): Document CJK ambiguous width change
for non-Unicode charsets.
* new-features.sgml (ov-new1.7.8): Mention CJK ambiguous width change.
Drop redundant NT4 pre-SP4 support entry.
2010-10-09 Corinna Vinschen <corinna@vinschen.de> 2010-10-09 Corinna Vinschen <corinna@vinschen.de>
* new-features.sgml (ov-new1.7.8): Document reinstantiation of the * new-features.sgml (ov-new1.7.8): Document reinstantiation of the

View File

@ -35,7 +35,10 @@ Drop support for Windows NT4 prior to Service Pack 4.
</para></listitem> </para></listitem>
<listitem><para> <listitem><para>
Drop support for Windows NT4 prior to Service Pack 4. Fix the width of "CJK Ambiguous Width" characters to 1 for singlebyte charsets
and 2 for East Asian multibyte charsets. (For UTF-8, it remains dependent on
the specified language, and the "@cjknarrow" locale modifier can still be used
to force width 1.)
</para></listitem> </para></listitem>
</itemizedlist> </itemizedlist>

View File

@ -353,17 +353,16 @@ With the "@cyrillic" modifier it's UTF-8.
<listitem><para> <listitem><para>
There's a class of characters in the Unicode character set, called the There's a class of characters in the Unicode character set, called the
"CJK Ambiguous Width Character set". For these characters the width "CJK Ambiguous Width" characters. For these characters, the width
returned by the wcwidth/wcswidth function is usually 1. This is often a returned by the wcwidth/wcswidth functions is usually 1. This can be a
problem in East-Asian languages, which historically use character sets problem with East-Asian languages, which historically use character sets
in which these characters have a width of 2. By default, the where these characters have a width of 2. Therefore, wcwidth/wcswidth
wcwidth/wcswidth functions return 1 as the width of these characters, return 2 as the width of these characters when an East-Asian charset such
except if the language is specifed as "ja" (Japanese), "ko" (Korean), or as GBK or SJIS is selected, or when UTF-8 is selected and the language is
"zh" (Chinese). In these languages wcwidth and wcswidth return 2 for specified as "zh" (Chinese), "ja" (Japanese), or "ko" (Korean). This is
these characters. This is not correct in all circumstances, so the user not correct in all circumstances, hence the locale modifier "@cjknarrow"
of one of these languages can specify the modifier "@cjknarrow", which can be used to force wcwidth/wcswidth to return 1 for the ambiguous width
modifies the behaviour of wcwidth/wcswidth to return 1 for the ambiguous characters.
width characters.
</para></listitem> </para></listitem>
</itemizedlist> </itemizedlist>