[svn.haxx.se] · SVN Dev · SVN Users · SVN Org · TSVN Dev · TSVN Users · Subclipse Dev · Subclipse Users · this month's index

File and folder names corrupted when importing from CVS using cvs2svn

From: Bo Berglund <bo.berglund_at_gmail.com>
Date: Thu, 18 Jan 2018 17:38:04 +0100

A few weeks ago I migrated our CVS repositories to SVN using the
latest version of cvs2svn. The CVS server was on a Windows 2003 server
and the new VisualSVN server on a Windows 2016 Server. I did the
cvs2svn conversion on a Ubuntu server to which I had copied the CVS
Then I imported the dump files on Windows 2016 via the VisualSVN GUI

It all looked like it was a success except for the handling of file
properties. Files marked binary in CVS are not marked as such in SVN
so I worry that they may become corrupted in the future. But it seems
like it does work OK.

Today I discovered a different issue that is more serious in nature:
In some projects there have been non-US characters used for file and
folder names like "Bygglovsansökan", "HÅL I TAKKASSETT" etc.

When I check out these projects from SVN the Swedish characters in the
names are now replaced by a series of high characters (hex view):

Å = C3 90 C2 9F
Ä = C3 90 C2 9E
Ö = -- did not find this character in svn yet --
å = C3 90 C2 96
ä = C3 90 C2 94
ö = C3 90 C2 A4

I don't know from where this problem originates, either it is a flaw
in the cvs2svn script, the configuration of the conversion or in the
format of the generated dump files.
Otherwise it may be a problem when importing the dump files into the
VisualSVN server....

What could I do to fix this?
(And please note that the new repository is in use so there are a
number of commits done since the migration...)

Bo Berglund
Developer in Sweden
Received on 2018-01-18 17:39:15 CET

This is an archived mail posted to the Subversion Users mailing list.