[svn.haxx.se] · SVN Dev · SVN Users · SVN Org · TSVN Dev · TSVN Users · Subclipse Dev · Subclipse Users · this month's index

Re: Umlaut problem on Mac (composed vs. decomposed UTF-8)

From: Marc Haisenko <haisenko_at_comdasys.com>
Date: 2007-07-17 10:20:52 CEST

On Monday 16 July 2007 01:23, Branko Čibej wrote:
> What Subversion should do is not only know about the encoding of file
> names in the filesystem, but also the platform-specific normalization
> form. AIUI, WIndows always uses normalization form C; Mac uses form D.
> Linux I've no idea about, but I suspect it uses form C.
> Internally, Subversion should normalize all Unicode strings, and I'd
> propose to use form C, since it's the most compact canonical representation.
> -- Brane

Having dealt with the composed/decomposed problem on Macs myself in a script
the last few days, I totally agree that SubVersion should support the
composed/decomposed problem. As far as I know Mac OS X normalized to form D
even if you feed form C. So it would be propably be the logical way to use
form C in SubVersion code because we know it already behaves correct on at
least Windows and Linux.

Marc Haisenko
Comdasys AG
Rüdesheimer Straße 7
D-80686 München
Tel:   +49 (0)89 - 548 433 321
e-mail: haisenko@comdasys.com
To unsubscribe, e-mail: dev-unsubscribe@subversion.tigris.org
For additional commands, e-mail: dev-help@subversion.tigris.org
Received on Tue Jul 17 10:20:11 2007

This is an archived mail posted to the Subversion Dev mailing list.

This site is subject to the Apache Privacy Policy and the Apache Public Forum Archive Policy.