[svn.haxx.se] · SVN Dev · SVN Users · SVN Org · TSVN Dev · TSVN Users · Subclipse Dev · Subclipse Users · this month's index

Re: Standardizing on UTF8 internally isn't enough

From: Daniel A. Steffen <das_at_users.sourceforge.net>
Date: 2007-07-18 00:44:47 CEST

On 18/07/2007, at 6:55, steveking wrote:

> Because the problem is clearly OS dependent. For example, "NTFS
> does no Unicode normalization at all"

as your example shows, normalization behavior is in fact not OS
dependent, but rather filesystem specific:
it is the HFS+ volume format that mandates storing filenames in a
specific fixed normalization form (which is not exactly NFD as per
UAX#15, e.g. some hangul chars are not decomposed [1]), other
filesystems on the mac do not have this requirement (e.g. NTFS), and
conversely, HFS+ volumes can be mounted on non-mac systems (e.g.
linux [2]) and expose its specific normalization behavior there...

[1] http://developer.apple.com/technotes/tn/
tn1150.html#CanonicalDecomposition
[2] http://www.ardistech.com/hfsplus/

Cheers,

Daniel

-- 
** Daniel A. Steffen                   **
** <mailto:das@users.sourceforge.net>  **
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@subversion.tigris.org
For additional commands, e-mail: dev-help@subversion.tigris.org
Received on Wed Jul 18 00:44:10 2007

This is an archived mail posted to the Subversion Dev mailing list.

This site is subject to the Apache Privacy Policy and the Apache Public Forum Archive Policy.