RFC: safe pattern matching for problematic encoding

Matt Mackall mpm at selenic.com
Fri May 25 13:52:00 CDT 2012


On Fri, 2012-05-25 at 12:03 +0200, Martin Geisler wrote:
> FUJIWARA Katsunori <foozy at lares.dti.ne.jp> writes:

> Okay, I can see why there might be some problems there. But for 99.9% of
> the cases I think Python's Unicode support is okay. Things that breaks
> must be pretty obscure, right? In those cases I would tell users that
> their filename isn't supported.

Try to decode an NFD string into any byte encoding other than UTF-8. Not
even Python 3 does this right. Hurray for Unicode.

-- 
Mathematics is the supreme nostalgia of our time.




More information about the Mercurial-devel mailing list