[PATCH] Also treat "East Asian Ambiguous" characters as full-width

Benoit Boissinot benoit.boissinot at ens-lyon.org
Fri Oct 30 06:34:57 CDT 2009


On Fri, Oct 30, 2009 at 11:46:52AM +0900, Shun-ichi Goto wrote:
> # HG changeset patch
> # User Shun-ichi GOTO <shunichi.goto at gmail.com>
> # Date 1256870784 -32400
> # Node ID ba57125215c3a2adc760e98668e523a13f647dac
> # Parent  3c30ae2d6f1bc0b74c87ea46b695e59485f79414
> Also treat "East Asian Ambiguous" characters as full-width.
> 
> "East Asian Ambiguous" characters like 'GREEK SMALL LETTER BETA'
> (U+03B2) or 'MULTIPLICATION SIGN' (U+00D7) should be counted as
> full-width because it depends on the context.
> 
> See also:
>  "Unicode Standard Annex #11 - East Asian Width"
>  http://www.unicode.org/reports/tr11/tr11-14.html#Ambiguous

It says:
"""
When processing or displaying data
[...]
* Ambiguous characters behave like wide or narrow characters depending
  on context (language tag, script identification, associated font,
  source of data, or explicit markup; all can provide the context). If
  the context cannot be established reliably they should be treated as
  narrow characters by default.
"""

But we have no context here, right? So shouldn't we treat them as
narrow?

regards,

Benoit

-- 
:wq


More information about the Mercurial-devel mailing list