On Wed, May 29, 2002 at 07:45:57PM -0400, Greg Hudson wrote:
> On Wed, 2002-05-29 at 19:29, Greg Stein wrote:
> > Of course it applies to log messages. You want to be able to extract exactly
> > what you put in. That implies that a consistent and uniform character set is
> > chosen.
>
> > As I said elsewhere, we decided on UTF-8 for text for everything a long long
> > time ago. We wanted to absolutely avoid all this character set nonsense. So
> > picking *one* character set (which is theoretically a superset of all
> > others) is nice. It helps all users of the libraries.
>
> I can't tell whether you're advocating the approach taken by Marcus
> (where we translate from UTF-8 to the local character set whenever we
> interact with the system) or not.
Hrm. To clarify: I support Marcus' approach.
> We don't "avoid all this character
> set nonsense" if we do the translation, but not doing it means users'
> tools must all use UTF-8 (including all tools which interact with
> pathnames in the working directory).
We avoid it within the library and its APIs. To make the data within the
repository useful, it must record or imply a character set for each datum.
As that data moves out through the libraries, it would need to carry the
character set since one user's charset might not match whatever was stored
into the repos (and having the recorded charset provides the capability to
recode from the stored chars to the user's chars).
To simplify the data storage and flow, we just say "it's all UTF-8". At the
boundaries between the SVN libraries and the client programs, the program
can (as appropriate) recode from UTF-8 to another charset.
Tools are not going to be required to use UTF-8. Yes, it would be nice to
live in a UTF-8 world, but we've got more than enough problems to solve :-)
Cheers,
-g
--
Greg Stein, http://www.lyra.org/
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@subversion.tigris.org
For additional commands, e-mail: dev-help@subversion.tigris.org
Received on Sat Jun 1 14:35:34 2002