[svn.haxx.se] · SVN Dev · SVN Users · SVN Org · TSVN Dev · TSVN Users · Subclipse Dev · Subclipse Users · this month's index

Re: Is FSFS compressing stored data?

From: Greg Hudson <ghudson_at_MIT.EDU>
Date: 2004-10-14 17:55:21 CEST

On Thu, 2004-10-14 at 10:34, Mark Phippard wrote:
> kfogel@newton.ch.collab.net wrote on 10/14/2004 08:37:08 AM:
> > Sure, the head revision is uncompressed. But the vast majority of
> > data in the repository is compressed, and therefore I think it's fair,
> > when speaking in general terms, to say that the repository is
> > compressed. (Unless the question was specifically about head, which I
> > may have missed.)

The historical data in the repository may not be "the vast majority of
the data" in some repositories. For instance, my work repository is
primarily an integration repository--imports of external source code
with a few local mods. Although we do import new versions of the
external source code periodically (thus creating history), the ratio of
head data to history is quite large.

> I believe Greg's point is that the presence of these full-texts,
> especially if you have a lot of branches, accounts for a lot of the
> difference in the repository sizes. Also, if I understand Greg, even the
> head revision in fsfs undergoes some degree of compression, such as
> stripping out whitespace etc...

Stripping out whitespace? No.

In FSFS, the first rev of a file is stored as a delta against empty,
which is a form of compression comparable to gzip (though not quite as
good). Later revs of the file are stored as deltas against earlier
revs--but not always the most immediate earlier rev. See
http://svn.collab.net/repos/svn/trunk/notes/skip-deltas for details.

---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe@subversion.tigris.org
For additional commands, e-mail: users-help@subversion.tigris.org
Received on Thu Oct 14 17:56:00 2004

This is an archived mail posted to the Subversion Users mailing list.

This site is subject to the Apache Privacy Policy and the Apache Public Forum Archive Policy.