[svn.haxx.se] · SVN Dev · SVN Users · SVN Org · TSVN Dev · TSVN Users · Subclipse Dev · Subclipse Users · this month's index

Re: "Strange" characters in file names

From: Erik Huelsmann <ehuels_at_gmail.com>
Date: 2007-03-25 17:44:33 CEST

On 3/25/07, Per Jessen <per@computer.org> wrote:
> Erik Huelsmann wrote:
>
> > On 3/25/07, Per Jessen <per@computer.org> wrote:
> >> Ryan Schmidt wrote:
> >>
> >> > On Mar 24, 2007, at 12:36, <svn.users@salvisberg.com>
> >> >> The file name should be "Kurs Ernährung.doc", and it probably
> >> >> orginated from Windows.
> >> >>
> >> >
> >> > Perhaps you have not set the LANG variable so ls and svn don't know
> >> > how to properly display it. Try export LANG=de_DE.utf8 or whatever
> >> > the correct value for your OS is.
> >>
> >> That setting should not have any influence on how UTF8 characters are
> >> displayed. They are valid regardless of your locale settings.
> >
> > They may -however- not be valid Subversion input without that
> > setting...
>
> That is true, but I would call that a bug. The use or display of
> characters should not depend on which language subversion is set to
> use. If you have UTF-8 support, it's either or, not "mostly"
> or "often". IMHO anyway.

Sure. Subversion is always UTF-8, internally. Externally, it uses the
encoding specified for your system. When you give it input which isn't
encoded with the system encoding, it doesn't know how to recode it to
UTF8.

When the system encoding isn't specified, US-ASCII is assumed (as
defined in the POSIX/C standard). So, when you don't specify an
encoding (meaning ASCII) and feed it UTF8 accented characters, it
doesn't know how to recode: characters with the high bit set aren't
part of the US-ASCII definition...

bye,

Erik.
Received on Sun Mar 25 17:45:02 2007

This is an archived mail posted to the Subversion Users mailing list.

This site is subject to the Apache Privacy Policy and the Apache Public Forum Archive Policy.