[svn.haxx.se] · SVN Dev · SVN Users · SVN Org · TSVN Dev · TSVN Users · Subclipse Dev · Subclipse Users · this month's index

Re: Umlaut problem on Mac (composed vs. decomposed UTF-8)

From: Erik Huelsmann <ehuels_at_gmail.com>
Date: 2007-07-23 20:30:29 CEST

On 7/23/07, Thomas Singer <subversion@smartcvs.com> wrote:
> From science back to the core problems. IMHO following problems must be
> solved (not necessarily more, but definitely not less):

Right.

> 1) it should be possible to handle files with umlauts in its name without
> setting the "right" encoding

The resolution to this problem is currently under discussion on the
dev@apr.apache.org list (as far as MacOSX goes, other *nix are an
entirely different matter and unfortunately won't work without setting
the right encoding).

> 2) it should be possible to work on files with umlauts in its name on
> Windows/Linux as well as on Mac OS X, no matter on what platform they were
> added initially; on each platform the usual presentation must be used

Exactly. But your solution has a number of implied consequences. When
extending the Subversion design, we're used to hashing out all the
consequences and choose the solution which is the best trade-off of
all problems solved (and new ones created). We're in that phase now.

> Personally I don't care about some border-cases that exist in unicode, but I
> care about characters of - at least - the western hemisphere (umlauts,
> accents and so on). Let's find a way to solve that problem in SVN 1.5
> without making a science out of it.

You don't have to care about the border-cases in Unicode, but if we
don't want to introduce the same problem, but then in reverse (ie stop
working on Windows), we'll have to work through all problems and
solutions before actually doing any coding. This leaves enough
surprise problems to be resolved when we release the Release
Candidates.

> I'd suggest starting with a simple converter implementation like this:
>
> http://72.9.228.230/svn/jsvn/trunk/svnkit/src/org/tmatesoft/svn/core/internal/wc/SVNFileListUtil.java
>
> If it is not enough, one can add special character mappings later.

bye,

Erik.

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@subversion.tigris.org
For additional commands, e-mail: dev-help@subversion.tigris.org
Received on Mon Jul 23 20:29:32 2007

This is an archived mail posted to the Subversion Dev mailing list.

This site is subject to the Apache Privacy Policy and the Apache Public Forum Archive Policy.