[svn.haxx.se] · SVN Dev · SVN Users · SVN Org · TSVN Dev · TSVN Users · Subclipse Dev · Subclipse Users · this month's index

Re: [PATCH] stricter parsing of values in authz files

From: Jesper Steen Møller <jesper_at_selskabet.org>
Date: 2006-04-28 22:04:11 CEST

Peter N. Lundblad wrote:
> Garrett Rooney writes:
>
> > +#include <apr_lib.h> /* for apr_isspace() */
>
> Hmmm, which encoding is the authz file supposed to be in *shrug*. I
> ask because apr_isspace is locale-dependent. If it is UTF8, use
> svn_ctype.h instead.
>
As the a Chinese user a few weeks back didn't expect, the authz files
(and all other Subversion config files) are used raw as UTF-8 - mostly
untouched 8-bit byte sequences are copied into in-memory structures.
Also, on AS/400 the files always read as UTF-8 regardless of the CCSID
set on the file.

I guess this could be documented somewhere.

Also: On Windows clients, there is a charset mismatch between the byte
sequences read from config files (UTF8) and the values read from the
registry (using the RegXxxxA APIs), which are 8-bit ANSI codepage values.

-Jesper

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@subversion.tigris.org
For additional commands, e-mail: dev-help@subversion.tigris.org
Received on Fri Apr 28 22:01:24 2006

This is an archived mail posted to the Subversion Dev mailing list.