[svn.haxx.se] · SVN Dev · SVN Users · SVN Org · TSVN Dev · TSVN Users · Subclipse Dev · Subclipse Users · this month's index

Re: Issue tracker cleanup: SVN-807

From: Branko ─îibej <brane_at_apache.org>
Date: Tue, 12 Nov 2019 16:33:40 +0100

On 12.11.2019 16:13, Nathan Hartman wrote:
> I'm guessing
> there was a desire to do more than print the offending hex codes but
> I don't know what else you could do.

If it's invalid UTF-8, there's nothing you can do.

There's a lot you can do if the UTF-8 is valid, but the target locale
can't represent the characters. Transliteration is a thing, but if we go
down that path, it's hard to decide when to stop. Subversion is not a
linguistics tool, it's a version control tool.

-- Brane

P.S.: I think there's even code that removes diacritical marks from the
source Unicode to make conversion to the target locale easier, at least
for Latin-based locales. That's already a kind of transliteration, and
even that may be way too much.
Received on 2019-11-12 16:33:45 CET

This is an archived mail posted to the Subversion Dev mailing list.