[svn.haxx.se] · SVN Dev · SVN Users · SVN Org · TSVN Dev · TSVN Users · Subclipse Dev · Subclipse Users · this month's index

Re: Non Ascii chars in paths cause trouble

From: Lübbe Onken <l.onken_at_rac.de>
Date: 2004-09-22 10:46:35 CEST

Erik Huelsmann wrote:

> What locale does your vi terminal run in? does it use iso-8859-1 / -15
> character encoding? (assuming you use a german locale)
locale tells me it's de_DE@euro for my svn user and POSIX for root. I
tried setting it to de_DE.utf8 before dumping the repository, but that
didn't make any difference.

> If it does, then the fact that the tags directory name looks like it does
> (with the accented characters) is quite alarming. They should have been
> recoded to utf-8, which looks 'scrambled' on a iso-8859-xx encoded terminal.
:-|

> You can dump your repositories, find the paths which are not correctly utf-8
> encoded, replace the invalid characters with valid utf-8 characters and try
> loading the dumpfiles. That's all I can advise you to do. It's not much, a
> lot of work, but it's possible.
Ouch, I was afraid of that answer. So the only thing left to me is
dumping, grepping "Node-path"s for accented characters and replacing them?
But what about accented characters inside my source? Should they be
UTF-8 in the dumpfile as well or are the file contents not touched by svn?

I still have the problem on the target system, that the german
characters in my commit messages are not displayed properly when
checking with svnlook. I get two possible (but no good) results
depending on my locale setting.

svnlook on the server:
======================
svn@raneu:~> locale
LANG=de_DE@euro
...
svn@raneu:~> svnlook log -r41 repository/testrepos/
Doppelte Datei auf dem Server gel?\246scht

svn@raneu:~> export LANG=de_DE.utf-8
svn@raneu:~> svnlook log -r41 repository/testrepos/
Doppelte Datei auf dem Server gelöscht

Browsing the repository on the server using WebSVN also results in the
?\246, since WebSVN is using svnlook to fetch the status.

svn log on the server:
======================
using 'svn log' on the server results in a proper log message:
svn_at_raneu:~> svn log file:///svn/repository/testrepos/
------------------------------------------------------------------------
r41 | lonken | 2004-09-22 10:15:29 +0200 (Mit, 22 Sep 2004) | 1 line

Doppelte Datei auf dem Server gelöscht

svn log on windows client:
=========================
D:\Testprojekt\RA_Neu>svn log
------------------------------------------------------------------------
r41 | lonken | 2004-09-22 10:15:29 +0200 (Mi, 22 Sep 2004) | 1 line

Doppelte Datei auf dem Server gel?\195?\182scht

TortoiseSVN on Windows client:
=============================
Revision: 41
Autor: lonken
Datum: 22.09.2004 10:15:29
Meldung:
Doppelte Datei auf dem Server gelöscht

----
Löschen  /tags/Test ob geht/Lib/lizenz.pas
That makes me hope that the log messages are at least dumped&loaded OK.
But:
- Why does 'svn log' on windows display ?\195?\182 instead of 'ö'?
- Why does 'svn log' on Linux display the log message properly?
- Why does 'svnlook log' on Linux *not* display the log message 
properly? (same bash, same locale...) Is this a bug in svnlook?
Cheers
- Lübbe
--
        ___
   oo  // \\      "De Chelonian Mobile"
  (_,\/ \_/ \     TortoiseSVN
    \ \_/_\_/>    The coolest Interface to (Sub)Version Control
    /_/   \_\     http://tortoisesvn.tigris.org
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@subversion.tigris.org
For additional commands, e-mail: dev-help@subversion.tigris.org
Received on Wed Sep 22 10:47:04 2004

This is an archived mail posted to the Subversion Dev mailing list.

This site is subject to the Apache Privacy Policy and the Apache Public Forum Archive Policy.