RE: UTF-8 (was: Re: property names)
From: Bill Tutt <billtut_at_microsoft.com>
Date: 2000-12-22 01:12:33 CET
Not that this is a how best to localize software mailing list, but....
UTF-8 is only compact if you're a western European type.
CJK will actually need UTF-16 character pairs in order to cover all of
If you think that still takes up too much room, go take a look at:
This describes an encoding stream that gives UTF-8 like (storage
UTF-16 strikes a nice balance being easy to deal with (95% fixed width),
This is an archived mail posted to the Subversion Dev mailing list.