UTF-16 in Mercurial

Augie Fackler lists at durin42.com
Mon Mar 8 12:56:59 CST 2010


2010/3/6 Martin Geisler <mg at lazybytes.net>:
> Benoît Allard <benoit at aeteurope.nl> writes:
>
>> Hi there,
>>
>> I've been experimenting on Windows with some UTF-16 (so called UNICODE
>> under Windows) config files (registry export to be complete) and the
>> attached -very- little extension that tries to make UTF-16 (or UTF-32)
>> seen as text (not binary).
>>
>> It has the drawback of generating non consistent patches: the body of
>> the patch being in the encoding of the file, and the metadata (@@,
>> +++, ...) being in ANSI.
>>
>> In one word, it's a dead end. [...]
>
> Thanks for testing it.
>
> How does Subversion and other systems cope with these files?

Subversion doesn't. It assumes you're dealing with ASCII (for
diffing), and if you're not, the only option is to use an external
diffing tool that groks the extended format.

>
> --
> Martin Geisler
>
> Fast and powerful revision control: http://mercurial.selenic.com/
>
> _______________________________________________
> Mercurial-devel mailing list
> Mercurial-devel at selenic.com
> http://selenic.com/mailman/listinfo/mercurial-devel
>
>


More information about the Mercurial-devel mailing list