German umlauts in file names

Matt Mackall mpm at selenic.com
Thu Jun 19 11:55:33 CDT 2008


On Thu, 2008-06-19 at 14:07 +0200, Hans Meine wrote:
> Am Donnerstag, 19. Juni 2008 13:49:18 schrieb Marko Kaening:
> > How do I find out which charset hg or THg is actually using?
> 
> In my experience, there is often no such thing as "the charset XYZ uses" 
> nowadays.  Think of it more like which charset your file content/name is 
> encoded at, and every application on every platform must agree on that.
> 
> In your case,
> a) what is the encoding of the SVN repo's content?
> b) Is it the same whether the SVN repo is created on Linux or Windows?
> c) What is the encoding of the converted hg repo's content?
> d) Is it the same whether you ran "hg convert" on Linux or Windows?
> 
> If all repos use e.g. UTF-8 (which would be very sane), the only thing to be 
> checked is whether strings provided via the UI are correctly transformed 
> to/from that charset.
> 
> I don't know about THg, but hg is a python program, for which the encoding 
> used for filenames is sys.getfilesystemencoding(),

Mercurial by design does absolutely no encoding on filenames, as
filenames very often have to byte-for-byte agree with their
representation in other files such as makefiles, etc.

-- 
Mathematics is the supreme nostalgia of our time.



More information about the Mercurial mailing list