[svn.haxx.se] · SVN Dev · SVN Users · SVN Org · TSVN Dev · TSVN Users · Subclipse Dev · Subclipse Users · this month's index

Re: Standardizing on UTF8 internally isn't enough

From: Mark Phippard <markphip_at_gmail.com>
Date: 2007-07-19 14:22:56 CEST

On 7/19/07, Erik Huelsmann <ehuels@gmail.com> wrote:
> On 7/19/07, Justin Erenkrantz <justin@erenkrantz.com> wrote:
> > On 7/17/07, Erik Huelsmann <ehuels@gmail.com> wrote:
> > > This problem could be solved to by adding the ICU lib as a dependency
> > > and change all path comparisons to use the ICU normalform agnostic
> > > comparison routines.
> >
> > I am pretty much against the idea of adding a *required* 39MB (~10MB
> > gzip'd) library that isn't widely available as a requirement to run
> > SVN. Yes, Debian/Ubuntu has it as do some others; but it's not that
> > popular; quick look at the release notes indicates that ICU requires
> > VC2003+ for compilation - so does it actually support VC6 (listed as
> > "rarely tested")? We had enough Win32 people revolt against APR-iconv
> > - now, we're going to backtrack and add in a *huge* library to replace
> > it? Yikes. This is like squashing a fly with a sledgehammer, IMO.
> >
> > Why can't we just do input validation ourselves? This is clearly a
> > very specific corner case and one we can detect quite trivially (i.e.
> > look for the chars that only exist in normalization form D).
>
> Input validation isn't enough. We can't guarantee that all paths
> internally will be of the C form, if the admin isn't able to control
> the clients setups. (Older Mac clients will still be able to submit D
> form paths).
>
> mbk said that there's no reason in supplying backward compat for macs,
> since they are already broken, but the broken-ness only shows when
> working in a mixed Windows/Mac/Linux setup.

I am not against a better solution, but I still think a "good enough"
solution would be a big improvement.

> PS: The ICU library can be can be reduced in size to 1/10th if you
> only want normalization.
> PPS: Did people really revolt against apr-iconv because of its size?

No, it was not the size. Although that was a bonus. The problem is
that the library is poorly designed in terms of supporting multiple
applications using multiple versions or even built with different
compilers.

I'd like to see a real number on size. The ICU web site doesn't seem
to give sizes as big as what has been quoted here.

-- 
Thanks
Mark Phippard
http://markphip.blogspot.com/
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@subversion.tigris.org
For additional commands, e-mail: dev-help@subversion.tigris.org
Received on Thu Jul 19 14:22:03 2007

This is an archived mail posted to the Subversion Dev mailing list.