[svn.haxx.se] · SVN Dev · SVN Users · SVN Org · TSVN Dev · TSVN Users · Subclipse Dev · Subclipse Users · this month's index

Re: Call For Votes: converting log messages to UTF-8

From: Karl Fogel <kfogel_at_newton.ch.collab.net>
Date: 2002-05-31 20:32:58 CEST

Greg Stein <gstein@lyra.org> writes:
> 2) if you convert FOO characters, thinking they were BAR, then it will
> certainly be "funky", but you still won't have data loss -- convert back
> as if you had BAR.
>
> So. Option 1 is riskless in terms of data loss.

Huh? I don't think this is true.

The transformation can be lossy. For example, suppose you write your
log message in stateless encoding FOO (it may be fixed-width or not,
but it's not stateful). But Subversion mistakenly deduces from your
locale that it's in *stateful* encoding BAR. When it converts to
UTF-8, the (alleged) escape sequences of what svn took to be BAR will
be lost. You cannot get the original string back now.

Greg Hudson was saying something similar in his mail about JIS, I
believe.

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@subversion.tigris.org
For additional commands, e-mail: dev-help@subversion.tigris.org
Received on Sat Jun 1 14:11:32 2002

This is an archived mail posted to the Subversion Dev mailing list.