[svn.haxx.se] · SVN Dev · SVN Users · SVN Org · TSVN Dev · TSVN Users · Subclipse Dev · Subclipse Users · this month's index

[TSVN] Re: arabic support

From: Simon Large <slarge_at_blazepoint.co.uk>
Date: 2004-11-30 13:06:02 CET

SteveKing wrote:
> On Tue, 30 Nov 2004 04:53:10 +0300, gar <gar@arabink.com> wrote:
>> Maybe my files are missing the BOM? What's the codepoint, I'll try
>> to insert it. Still, various tools I use are able to display the
>> Arabic ok without a BOM (I think).
>
> The problem with files without BOM's is that you can't tell for sure
> if they're UTF8 or not. There are some guessing functions available,
> but they all fail under certain circumstances. That's why I haven't
> implemented such a guessing function in TortoiseMerge and only rely on
> the BOM's (the only thing that's 100% sure).

In TortoiseMerge the TSVN german doc files are displayed as garbage when
the BOM is missing, so that is very likely your problem.

The BOM consists of 3 bytes { 0xef, 0xbb, 0xbf } if you want to add them
manually.

I wrote a simple command line program to check for and add BOMs and XML
tags (encoding=UTF-8). Originally this was just to make it easier to
maintain the TSVN docs which lose their BOMs from time to time. If it is
of any interest I can tidy it up and add it to the source tree (where?)
or just zip it and email to individuals.

Simon

-- 
       ___
  oo  // \\      "De Chelonian Mobile"
 (_,\/ \_/ \     TortoiseSVN
   \ \_/_\_/>    The coolest Interface to (Sub)Version Control
   /_/   \_\     http://tortoisesvn.tigris.org
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@tortoisesvn.tigris.org
For additional commands, e-mail: dev-help@tortoisesvn.tigris.org
Received on Tue Nov 30 13:06:38 2004

This is an archived mail posted to the TortoiseSVN Dev mailing list.

This site is subject to the Apache Privacy Policy and the Apache Public Forum Archive Policy.