[svn.haxx.se] · SVN Dev · SVN Users · SVN Org · TSVN Dev · TSVN Users · Subclipse Dev · Subclipse Users · this month's index

Re: [Issue 2194] Unicde UTF-16 files detected as binary

From: Branko Čibej <brane_at_xbc.nu>
Date: 2005-01-06 02:41:28 CET

Peter N. Lundblad wrote:

>On Wed, 5 Jan 2005, Max Bowsher wrote:
>>Peter N. Lundblad wrote:
>>I agree with what you are saying, but what 2194 was saying was "UTF-16
>>should be detected as textual".
>Yes, it is more complicated than that, since it is an enconding where a
>line break is not one or two bytes, and for some other reasons. Still, I
>think we really need to support other Unicode encodings thatn UTF8, like
>we support other 8-bit encodings.
It is much more complicated than that. If we're to treat UTF-16 files as
text, we have to teach libsvn_diff to do diffs and merges correctly on
such files, and possibly enhance keyword expansion and newline
conversion, too.

In short, it's a whole can of worms that probably affects 90% of the
client-side code.

-- Brane

To unsubscribe, e-mail: dev-unsubscribe@subversion.tigris.org
For additional commands, e-mail: dev-help@subversion.tigris.org
Received on Thu Jan 6 02:42:40 2005

This is an archived mail posted to the Subversion Dev mailing list.

This site is subject to the Apache Privacy Policy and the Apache Public Forum Archive Policy.