Re: [PATCH] First cut at 1954 solution

From: <kfogel_at_collab.net>
Date: 2004-12-08 08:10:58 CET

Greg Stein <gstein@lyra.org> writes:
> > A series of 1 to 4 bytes that represents a Unicode code point,
> > using the encoding described by the UTF-8 specification.
>
> UTF-8 is an encoding rather than a character set. Thus, it does not
> define any characters. *Unicode* is a character set, and UTF-8 is a
> particularing encoding of that charset. Therefore, Brane is right:
> UTF-8 does not define any characters -- Unicode does.
>
> Branko's suggested text is "closer to reality". There is an error in
> the encoding, rather than an erroneous character.
>
> All that said: given that UTF-8 only applies to the Unicode charset,
> there is a very fine line there. But to be pedantic... :-)

Heh heh! Nice. Okay, I guess if we're really going to pick nits,
then Brane is right :-).

Ahem.

I hereby define a UTF-8 character to be [see above-quoted definition].

From now on, I will use the phrase "UTF-8 character" with this
meaning. I may even use that language in a Subversion doc string
someday. But I will not use it in an error message without bicameral
approval from the Slovenian parliament.

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@subversion.tigris.org
For additional commands, e-mail: dev-help@subversion.tigris.org
Received on Wed Dec 8 08:16:49 2004

This message: [ Message body ]
Next message: Lübbe Onken: "Re: Localization of user replies"
Previous message: Greg Stein: "Re: [PATCH] First cut at 1954 solution"
In reply to: Greg Stein: "Re: [PATCH] First cut at 1954 solution"
Next in thread: Branko Čibej: "Re: [PATCH] First cut at 1954 solution"
Reply: Branko Čibej: "Re: [PATCH] First cut at 1954 solution"

Contemporary messages sorted: [ By Date ] [ By Thread ] [ By Subject ] [ By Author ] [ By messages with attachments ]