[svn.haxx.se] · SVN Dev · SVN Users · SVN Org · TSVN Dev · TSVN Users · Subclipse Dev · Subclipse Users · this month's index

Re: use of UTF-8

From: Karl Fogel <kfogel_at_newton.ch.collab.net>
Date: 2002-05-31 16:37:11 CEST

Greg Hudson <ghudson@MIT.EDU> writes:
> On the other hand, there seems to be a fairly broad consensus for doing
> UTF-8/$LC_CTYPE character set conversion for filenames. I am...
> confused as to why anyone advocates converting filenames and not log
> messages, since they are both text. File contents are binary data.
> Property values... might be binary data; that seems to be the conensus
> for now, anyway, although that leads to questions about how svn:ignore
> should be interpreted and such. But log messages are definitely text.

Part of the justification is ease of implementation. We have to sling
filenames around all over the place internally, and write/read them in
xml files appx seventy times a second. It's just massively easier to
use `char *' UTF-8 for all that.

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@subversion.tigris.org
For additional commands, e-mail: dev-help@subversion.tigris.org
Received on Sat Jun 1 14:12:25 2002

This is an archived mail posted to the Subversion Dev mailing list.

This site is subject to the Apache Privacy Policy and the Apache Public Forum Archive Policy.