[svn.haxx.se] · SVN Dev · SVN Users · SVN Org · TSVN Dev · TSVN Users · Subclipse Dev · Subclipse Users · this month's index

Re: converting unconvertible UTF-8 data

From: Marcus Comstedt <marcus_at_mc.pp.se>
Date: 2002-07-21 15:56:18 CEST

Karl Fogel <kfogel@newton.ch.collab.net> writes:

> 3) Have a fuzzy conversion function that tries to convert all the
> data, but if that fails, converts every character it can and
> replaces the others with ?\XXX (or some standard sequence) to
> indicate the Unicode value of the failed character.
>
>[...]
>
> Right now I like (3) the best, since it doesn't force the user to do
> something different. Of course, we'd have to choose wisely where we
> use the fuzzy function -- again, only "log" comes to mind so far.

iconv already does fuzzy conversion on some systems. For example, on
Solaris 8, I get ?:s instead of the "offending" character in Brankos
name when I do svn log in a ISO-8859-1 locale. Right now, apr_xlate
doesn't inform us that this has happened, since it doesn't check the
return code of iconv properly (unless this has been fixed recently).
But it _should_ tell us, so a choice where to allow it should be made
anyway.

  // Marcus

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@subversion.tigris.org
For additional commands, e-mail: dev-help@subversion.tigris.org
Received on Sun Jul 21 16:02:42 2002

This is an archived mail posted to the Subversion Dev mailing list.

This site is subject to the Apache Privacy Policy and the Apache Public Forum Archive Policy.