[svn.haxx.se] · SVN Dev · SVN Users · SVN Org · TSVN Dev · TSVN Users · Subclipse Dev · Subclipse Users · this month's index

RE: Bug report: UTF-8 encoded files merge issue

From: Gary Hai <haiguiqing_at_gmail.com>
Date: 2005-12-30 02:35:08 CET

I have tried to change the font settings, but nothing happened.
From the about dialog, i got this version info: tortoiseSVN 1.2.6, Build
4786 - 32 Bit .

As Simon wrote:
>>It *does* support UTF-8, but it requires a proper BOM at start of file,
>> it cannot be sure what the encoding is. I think Stefan has just improved
>>auto-recognition so it will find UTF-8 with no BOM more easily now.

>>Stefan, how hard would it be to add a menu to allow the user to specify
>>encoding to use? Default would be 'Auto' as it is now, but the user could
>>override that if needed.

According to the specification of UTF-8, it seems very hard to
auto-recognition UTF-8 encode. Anyway, if TortoiseMerge is XML-aware design,
it can get encode info from XML head (default as UTF-8).
It is very good suggestion to specify the encodeing to use becaseu there is
already a drop down menu to selection language.

P.S. I have written a email to google ask for XML support in their Google
Desktop product, and they have added the feature very fast. All we know the
XML is more and more popular.



-----Original Message-----
From: Kalin KOZHUHAROV [mailto:kalin@thinrope.net]
Sent: Thursday, December 29, 2005 7:54 PM
To: dev@tortoisesvn.tigris.org
Cc: haiguiqing@gmail.com
Subject: Re: Bug report: UTF-8 encoded files merge issue

Gary Hai wrote:
> I am using MSN Messenger in two computers. And I try to use
> TortoiseSVN to synchronize the message history files.
OK, I needed to read your MSG 4 times to get what is this MSN doing here :-)

> While Chinese is my native language, the message history stored in XML
file by UTF-8 encoding.
What about your OS? Is it Chinese?

> When I run TortoiseMerge to diff and merge the two sources files, the
> Chinese characters cannot be displayed in correct encoding, and the
> merge can result in files corrupt. Is this a BUG for encoding?
By default TortoiseMerge (still) does not support UTF-8 or any other
encoding. The only supported encoding is the default of your OS (depends on
the OS language).

If you are using Chinese windoze, then in the preferences for the fonts used
in TortoiseMerge, select a proper font and that may fix the problem.

What is the verion of TSVN that you use? Is it the latest?


P.S. A statistic: I answer this/similar question at least once every week...

|[ ~~~~~~~~~~~~~~~~~~~~~~ ]|
+-> http://ThinRope.net/ <-+
|[ ______________________ ]|
To unsubscribe, e-mail: dev-unsubscribe@tortoisesvn.tigris.org
For additional commands, e-mail: dev-help@tortoisesvn.tigris.org
Received on Fri Dec 30 02:35:24 2005

This is an archived mail posted to the TortoiseSVN Dev mailing list.

This site is subject to the Apache Privacy Policy and the Apache Public Forum Archive Policy.