[svn.haxx.se] · SVN Dev · SVN Users · SVN Org · TSVN Dev · TSVN Users · Subclipse Dev · Subclipse Users · this month's index

Re: Problem with UTF-8-files and creating and appliying patches

From: Felix Saphir <felix.saphir_at_presswatch.de>
Date: Wed, 17 Feb 2010 14:26:33 +0100

Carsten Schuette schrieb:
>
>> What about removing the BOM in the first place? You don't need a BOM for
>> UTF-8 and it can lead to various problems (like the one you're reporting).
>
> Well, I have about 7.000 files (all created with Visual Studio 2008),
> which all have the BOM. And whenever I open and save the file with
> Visual Studio, the BOM might be back. I am also not sure how the BOM
> will affect the encoding, maybe all my umlauts are lost?
>
> Why is the BOM a problem? I think it's a bug that TortoiseMerge does
> not handle the BOM correctly when applying a patch.

While you might be correct about TortoiseMerge and BOM, UTF-8 has a
defined byte-order, so there is no need for a BOM (see
<http://www.unicode.org/faq/utf_bom.html#bom5>).

I don't know anything about Visual Studio, but this might help you
removing the BOM:
<http://distilledb.com/blog/archives/date/2009/03/07/removing-unwanted-unicode-encodings.page>

Felix

-- 
  /^\ | ASCII Ribbon Campaign
  \ / | - no HTML in email and news
   x  | http://www.asciiribbon.org/
  / \ | http://www.gerstbach.at/2004/ascii
------------------------------------------------------
http://tortoisesvn.tigris.org/ds/viewMessage.do?dsForumId=4061&dsMessageId=2448362
To unsubscribe from this discussion, e-mail: [users-unsubscribe_at_tortoisesvn.tigris.org].
Received on 2010-02-17 14:26:43 CET

This is an archived mail posted to the TortoiseSVN Users mailing list.

This site is subject to the Apache Privacy Policy and the Apache Public Forum Archive Policy.