Encoding

Matt Mackall mpm at selenic.com
Mon Dec 4 01:00:32 UTC 2006


I've taken my own stab at tackling the charset encoding issue. 
Here's a repo with the patches:

http://www.selenic.com/repo/hgencode/

This manages encoding for usernames and descriptions in changelogs,
tags, and branch names. Encoding in hgweb is also taken care of.

The detected charset can be seen with hg help -v:

    --encoding        set the charset encoding (default: UTF-8)
    --encodingmode    set the charset encoding mode (default: strict)

And those options can be used to override it (as well as environment
variables HGENCODING and HGENCODINGMODE). More on charset conversion here:

http://www.selenic.com/mercurial/wiki/index.cgi/ChangelogEncodingPlan

I plan to push this to mainline in the next couple days for the 0.9.2
release, so test away.

I spent a while wrestling with the Windows multiple charset issues and
my eventual decision was that we couldn't win and it wasn't really
worth fighting. Details here:

http://www.selenic.com/mercurial/wiki/index.cgi/Character_Encoding_On_Windows

-- 
Mathematics is the supreme nostalgia of our time.


More information about the Mercurial-devel mailing list