[svn.haxx.se] · SVN Dev · SVN Users · SVN Org · TSVN Dev · TSVN Users · Subclipse Dev · Subclipse Users · this month's index

Re: Problems with accents in filenames

From: Vincent Lefevre <vincent+svn_at_vinc17.org>
Date: 2003-11-23 01:28:32 CET

On 2003-11-22 15:41:22 +0000, Philip Martin wrote:
> Let's go through this step by step as I had to make some assumptions
> about what you did:
> >>From an UXterm:
> What locale is being used? (I going to assume it's a UTF-8 one.)


> > $ svn add testé
> Do you have "test\xe9" or "test\xc3\xa9" on disk? (I'm going to
> assume it's "test\xc3\xa9".)

The UTF-8 sequence, as expected.

> > $ svn commit
> >
> >>From an Xterm, with an ISO-8859-1 locale:
> > $ svn st
> > ? testé
> > ! testé
> Is this the same working copy, but with a different locale?

Same working copy. And I'm using an ISO-8859-1 locale.

> (I assume it is, and that you have changed your locale from UTF-8 to
> 8859-1. The physical UTF-8 filename "test\xc3\xa9" no longer match
> the expected 8859-1 filename "test\xe9", hence the output above. I
> disagree with you that this is Subversion breaking.)

I haven't changed the files. So, this is a bug in Subversion. Any
program should know that the user may use different locales when
he wants. Locales are not something linked to the system itself,
but a user choice, local for each process (different users may
choose different locales, and different processes may run with
different locales, even for the same user).

> I don't understand your problem, if you want filenames to be UTF-8
> then set a UTF-8 locale.

No! I want an ISO-8859-1 locale because I want to work with this
character set.

> If you do that Subversion will not allow any names that aren't
> UTF-8. Are you seriously suggesting that "test\xe9", a perfectly
> valid filename, should cause a Subversion error?

No, but in my case, such a filename should never be created (using
the right programs).

> You can get what you want by setting a UTF-8 locale, why should your
> encoding be enforced on others?

This is not my encoding, but the one used by programs that will work
consistently whatever the locales are (very important in a multi-user

Vincent Lefèvre <vincent_at_vinc17.org> - Web: <http://www.vinc17.org/> - 100%
validated (X)HTML - Acorn Risc PC, Yellow Pig 17, Championnat International
des Jeux Mathématiques et Logiques, TETRHEX, etc.
Work: CR INRIA - computer arithmetic / SPACES project at LORIA
To unsubscribe, e-mail: dev-unsubscribe@subversion.tigris.org
For additional commands, e-mail: dev-help@subversion.tigris.org
Received on Sun Nov 23 01:29:26 2003

This is an archived mail posted to the Subversion Dev mailing list.

This site is subject to the Apache Privacy Policy and the Apache Public Forum Archive Policy.