Unicode support in log messages and file names

Benoit Boissinot bboissin at gmail.com
Sat Nov 11 11:42:22 CST 2006


switching to -devel

On 11/11/06, Andrey <grooz-work at gorodok.net> wrote:
> > As you have seen in http://www.selenic.com/mercurial/bts/issue156
> > I always got distracted by other topics for too many months now.
> >
> > Feel free to provide something for encoding log messages, but
> > encoding file names is a different topic and needs discussion.
> >
>
> Yes, encoding messages seems much more easy to implement. I'm planning to do
> it like this:
> - decode user provided log messages from locale encodeded byte strings to
> unicode strings;
> - use terminal encoding to display those messages (provided as unicode
> strings);
> - encode log messages (provided as unicode string) in UTF-8 when storing;
> - decode log messages from UTF-8 to unicode string when retrieving;
> - (the most complex part) make sure switching from byte strings to unicode
> strings does not break thinks.
>
> Is this list OK?

Can you add it to the wiki for future reference ?
Otherwise it sounds sane to me, you will have to make sure it doesn't
have a big performance cost (the i18n plan was deferred because of
that).

regards,

Benoit


More information about the Mercurial-devel mailing list