[svn.haxx.se] · SVN Dev · SVN Users · SVN Org · TSVN Dev · TSVN Users · Subclipse Dev · Subclipse Users · this month's index

Re: Remaining l10n issues

From: Michael Sweet <mike_at_easysw.com>
Date: 2004-05-06 20:29:20 CEST

Peter N. Lundblad wrote:
> ...
> Yes, that's what I'm going to do. Problem is this:
> - svn_cmdline_printf() takes UTF-8 *strings* and other argument types
> - apr_psprintf will not touch the encoding of the strings, but it will
> ocnvert %d etc. to the locale's encoding (no, it won't, it converts it
> using numbers, . etc. from the C execution character set...)
> - When we are going to convert from UTF-8, we may have a mixed encoing
> string:-(
>
> Since the digits and the other characters that apr_psprintf will produce
> are in the ASCII (7-bit) range, they are the same for UTF-8 in ASCII-based
> encodings. So this shouldn't be a problem in most cases. But what happens
> on, i.e. EBCDIC systems? Anyone who knows?

If you use stdio s(n)printf, the local encoding system is used.
Current EBCDIC-based systems (i.e. AS/400), to my knowledge, provide
a compile-time option for using EBCDIC or ASCII as the base
character set. If you compile in EBCDIC mode, then the numbers
will be in EBCDIC. However, so will all other characters, so the
whole "UTF-8 is a subset of the standard charset" assumption is
out the window. Compile in ASCII mode and those problems go away,
so *if* you decide to support those systems, that is probably the
only feasible way.

Frankly, I doubt if Subversion compiles on AS/400 or any other
EBCDIC platform, so this is likely a non-issue.

> This is getting really messy, but since we have to replace stdio printf
> and friends to support other things, I think we will have to solve this
> problem.

At some point you may want to define a minimum platform to
support which includes being based on ASCII and not EBCDIC.

-- 
______________________________________________________________________
Michael Sweet, Easy Software Products           mike at easysw dot com
Printing Software for UNIX                       http://www.easysw.com
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@subversion.tigris.org
For additional commands, e-mail: dev-help@subversion.tigris.org
Received on Sat May 8 00:22:53 2004

This is an archived mail posted to the Subversion Dev mailing list.

This site is subject to the Apache Privacy Policy and the Apache Public Forum Archive Policy.