[svn.haxx.se] · SVN Dev · SVN Users · SVN Org · TSVN Dev · TSVN Users · Subclipse Dev · Subclipse Users · this month's index

Re: Unicode UTF-16 files detected as binary

From: Max Bowsher <maxb_at_ukf.net>
Date: 2005-01-06 13:26:14 CET

Barry Scott wrote:
> Can I file an issue now Max? I guess the issue should be
>
> "Support Unicode fully, UTF-8, UTF-16 and UTF-32"

Don't we already support UTF-8 fully?
If not, I'd say that that is a bug, whilst UTF-{16,32} is a feature request.

> For what it is worth I have just implement diff for utf-16 file in my
> workbench GUI.
> Obviously does not help the svn repo diff utf-16/utf-32 files.
>
> What is the line ending problem in UTF-16/UTF-32?
> Is it that you have to look for LF and CR in 16bit and 32bit chars
> rather
> then bytes that would give false positives?

Exactly.

See this useful post by Branko Cibej:
http://svn.haxx.se/dev/archive-2005-01/0178.shtml

To summarize, supporting UTF-{16,32} involves supporting it in:
* Diff
* Merge
* Keyword expansion
* Newline conversion
* Text/binary discrimination
... any others not thought about here?

Please, go ahead and file an issue. Leave out any UTF-8 bugs you are aware
of, and start a new to discuss them. Mention all 5 points in the issue.

Max.

---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe@subversion.tigris.org
For additional commands, e-mail: users-help@subversion.tigris.org
Received on Thu Jan 6 13:32:00 2005

This is an archived mail posted to the Subversion Users mailing list.

This site is subject to the Apache Privacy Policy and the Apache Public Forum Archive Policy.