* pathnames.sgml (pathnames-unusual): Talk about using UTF-8 in C
locale. * setup2.sgml (setup-locale-problems): Ditto.
This commit is contained in:
parent
b6c6ea43f3
commit
40c66067fb
|
@ -1,3 +1,9 @@
|
|||
2009-05-13 Corinna Vinschen <corinna@vinschen.de>
|
||||
|
||||
* pathnames.sgml (pathnames-unusual): Talk about using UTF-8 in C
|
||||
locale.
|
||||
* setup2.sgml (setup-locale-problems): Ditto.
|
||||
|
||||
2009-05-06 Corinna Vinschen <corinna@vinschen.de>
|
||||
|
||||
* faq-setup.xml: Fix entry explaing how the homedir is evaluated
|
||||
|
|
|
@ -368,6 +368,11 @@ filename because the question mark will not translate back to the original
|
|||
Chinese character, but to a simple question mark instead. This in turn
|
||||
results in strange "File not found" messages.</para>
|
||||
|
||||
<note><para>In the default "C" locale, Cygwin creates filenames using
|
||||
the UTF-8 charset. This will always result in some valid filename by
|
||||
default, but again might impose problems when switching to a non-"C"
|
||||
or non-"UTF-8" charset.</para></note>
|
||||
|
||||
<note><para>To avoid this scenario altogether, always use UTF-8 as the
|
||||
character set.</para></note>
|
||||
|
||||
|
|
|
@ -317,12 +317,15 @@ variable hasn't been set <emphasis>before</emphasis> starting this process,
|
|||
Cygwin has to make an educated guess which charset to use to convert
|
||||
the environment itself. The only reproducible way to do that in the absence
|
||||
of <envar>LC_ALL</envar>, <envar>LC_CTYPE</envar>, or <envar>LANG</envar>,
|
||||
is to use the current Windows ANSI codepage.</para>
|
||||
is to use the "C" locale. The default conversion in the "C" locale
|
||||
used by Cygwin internally is UTF-8. So, in the absence of any
|
||||
internationalization environment variable, the environment will be converted
|
||||
to UTF-8.</para>
|
||||
|
||||
<para>As long as the environment only contains ASCII characters, this is
|
||||
no problem. But if it contains native characters, and you're planning
|
||||
to use, say, UTF-8, the environment will result in invalid characters in
|
||||
the UTF-8 charset. This would be especially a problem in variables like
|
||||
no problem at all. But if it contains native characters, and you're planning
|
||||
to use, say, GBK, the environment will result in invalid characters in
|
||||
the GBK charset. This would be especially a problem in variables like
|
||||
<envar>PATH</envar>.</para>
|
||||
|
||||
<note><para>Per POSIX, the name of an environment variable should only
|
||||
|
|
Loading…
Reference in New Issue