[svn.haxx.se] · SVN Dev · SVN Users · SVN Org · TSVN Dev · TSVN Users · Subclipse Dev · Subclipse Users · this month's index

Re: Call For Votes: converting log messages to UTF-8

From: Greg Stein <gstein_at_lyra.org>
Date: 2002-05-31 20:02:51 CEST

On Fri, May 31, 2002 at 01:46:41PM -0400, Greg Hudson wrote:
> On Fri, 2002-05-31 at 13:38, Greg Stein wrote:
> > Converting from charset FOO to UTF-8 is a specific translation. No data
> > loss. Converting from UTF-8 back to FOO is a perfect restoration.
>
> Hm, is this always true?
>
> For instance, a Shift-JIS document could have redundant shift octets.
> (Is that invalid? If so, does an iconv() from Shift-JIS to UTF-8
> actually enforce that?) Converting such a document to UTF-8 and back is
> presumably not an identity transformation on the octets. If the source
> document was not actually in Shift-JIS but was in some other character
> set, you could lose data.

Heh. Evil :-) Yup. Sure sounds like if the user had the wrong locale,
they could lose something. They'd need to go back and correct the log
message, then.

Cheers,
-g

-- 
Greg Stein, http://www.lyra.org/
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@subversion.tigris.org
For additional commands, e-mail: dev-help@subversion.tigris.org
Received on Sat Jun 1 14:11:42 2002

This is an archived mail posted to the Subversion Dev mailing list.

This site is subject to the Apache Privacy Policy and the Apache Public Forum Archive Policy.