[svn.haxx.se] · SVN Dev · SVN Users · SVN Org · TSVN Dev · TSVN Users · Subclipse Dev · Subclipse Users · this month's index

Re: [RFC/PATCH] commit messages not 8-bit compatible

From: Greg Hudson <ghudson_at_MIT.EDU>
Date: 2002-05-30 15:00:38 CEST

On Wed, 2002-05-29 at 11:50, Karl Fogel wrote:
> He is sending UTF-8, and that's stimulating the bug. So recoding all
> commit messages as UTF-8 isn't going to help, right?

No, he's sending ISO-8859-1 or something. UTF-8 would use two-byte (or
longer) sequences for the funny characters.

> But anyway, how are we going to "recode" commit messages to UTF-8, if
> we don't know what encoding they're coming from?

Marcus's approach is to require that LC_CTYPE be set to the encoding
your tools use to write and display funny characters. (If people used
tools which wrote and displayed UTF-8, that would be ideal, and we would
never have to do any conversion. Unfortunately, there are a limited
number of tools which do so right now.)

> And what format are
> you storing them in in the revision property?

In the revision property, they would be UTF-8.

> (i.e., What
> circumstances are included in "when you want to look at them"?)

When you do "svn log", your client would get the log message back from
the library in UTF-8 and would convert to your local character set to
display it. Similarly if you use "svn propget".

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@subversion.tigris.org
For additional commands, e-mail: dev-help@subversion.tigris.org
Received on Sat Jun 1 14:34:19 2002

This is an archived mail posted to the Subversion Dev mailing list.

This site is subject to the Apache Privacy Policy and the Apache Public Forum Archive Policy.