[svn.haxx.se] · SVN Dev · SVN Users · SVN Org · TSVN Dev · TSVN Users · Subclipse Dev · Subclipse Users · this month's index

Performance (Subversion vs. CVS)

From: Greg Ward <gward_at_python.net>
Date: 2004-09-14 04:37:54 CEST

I'm looking into Subversion to use at work -- we have a pretty large
CVS repository (5 GB) that feels like its pushing the limits of CVS.
Probably the most obvious reason to change would be for performance --
constant-time branching and tagging are obviously attractive, as are
no-network-required diff and revert.

But I've been playing around with timing "cvs co" and "cvs up" versus
their Subversion counterparts, and the results are not good. In a
nutshell, CVS is 5-15 times faster at checking out and updating working
trees.

Platform: Debian Linux "unstable"
          cvs 1.12.9 (debian 1.12.9-4)
          subversion 1.0.6 (debian 1.0.6-2)

I converted a substantial chunk of our CVS repository (one top-level
directory which accounts for ~600 MB of the ~5 GB total) using cvs2svn.
Took most of a night and day to run, and resulted in a Subversion
repository with:
  ~1100 files (trunk)
  ~9900 revisions
     23 branches
   2407 tags (our CVS workflow is rather tag heavy)

Both the CVS and Subversion repositories are on a pretty fast (ATA100)
IDE disk, and I'm checking out to a pretty fast (Ultra160) SCSI disk.
(I didn't want to fill up my system's primary disk [the SCSI one] with
enormous repositories.) I used local filesystem access everywhere,
eg. "cvs -d /path/to/repos" and "svn co file:///path/to/repos/...".

Here are some timing results from CVS; all times are in seconds, each
measurement repeated 5 times (too lazy to record two decimal places on
the first test, got keen after that):

  initial co 4.1 3.6 3.9 5.4 5.0
  up (no-op) 1.00 0.92 0.87 0.86 1.46
  up -r BETA (from -A) 4.24 3.36 4.21 2.99 4.14
  up -A (from -r BETA) 1.96 2.61 2.12 1.80 2.53
  co -r <recent-branch> 6.83 5.85 5.05 6.31 5.48

The initial checkout was of the same top-level directory that I
converted to a Subversion repository. BETA is a tag that we use for the
"trailing edge" of the trunk, stuff deemed fit for internal builds and
consumption. Switching directories from BETA to the tip of the trunk is
a fairly common occurrence. Checking out recent branches also happens
roughly as often as checking out the trunk. (Not as often as
update/diff/commit, but often enough that I thought it would be worth
timing.)

Now here are the same activities with Subversion 1.0.6:

  co .../trunk 49.3 47.5 46.8
  up (no-op) 1.39 0.54 0.41 2.53 1.17
  switch .../tags/BETA 17.8 19.3 18.0
  switch .../trunk 34.3 35.6 35.7

Wow: the only operation where CVS doesn't completely blow Subversion out
of the water is "update" on a fresh checkout. The fresh checkout itself
is 10x slower with Subversion, and switching back and forth between
tip-of-trunk and BETA is about 5x slower. :-(

This is very disappointing; I was hoping that using an opaque binary
database would at least have slight performance benefits, not an order
of magnitude penalty. Should I at least try out 1.1? Any suggestions
for improving performance of Subversion, short of getting in there with
a profiler myself?

        Greg

-- 
Greg Ward <gward_at_python.net>                         http://www.gerg.ca/
Know thyself.  If you need help, call the CIA.
---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe@subversion.tigris.org
For additional commands, e-mail: users-help@subversion.tigris.org
Received on Tue Sep 14 05:41:01 2004

This is an archived mail posted to the Subversion Users mailing list.

This site is subject to the Apache Privacy Policy and the Apache Public Forum Archive Policy.