[svn.haxx.se] · SVN Dev · SVN Users · SVN Org · TSVN Dev · TSVN Users · Subclipse Dev · Subclipse Users · this month's index

Re: using isalpha/isalnum in locale-independent code

From: Peter N. Lundblad <peter_at_famlundblad.se>
Date: 2004-11-27 13:30:47 CET

On Sat, 27 Nov 2004, [UTF-8] Branko ^Libej wrote:

> Peter N. Lundblad wrote:
>
> >:-) Is your stomach OK yet? OK, we need Unicode-aware functions, but in
> >svnserve, we need ASCII-specific tests.
> >
> Characer classification functions work on single bytes, and if they're
> to know the difference between the single-byte subset of UTF-8 and the
> multibyte sequences, they have to know something about UTF-8, so they
> should be called svn_utf_*. Wait for my upcoming commit/
>
If you only care about the ASCII subset, you can certainly say that
everything >= 0x80 is not part of that subset. If you create
classification function for UTF-8 you need to give them a string and
document them as working on the first *character*, not byge, of that
string. Else, we need an UCS-4 value instread. Awaiting your commit.

Best,
//Peter

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@subversion.tigris.org
For additional commands, e-mail: dev-help@subversion.tigris.org
Received on Sat Nov 27 13:20:26 2004

This is an archived mail posted to the Subversion Dev mailing list.