[svn.haxx.se] · SVN Dev · SVN Users · SVN Org · TSVN Dev · TSVN Users · Subclipse Dev · Subclipse Users · this month's index

Some FSFS repo stats

From: Stefan Fuhrmann <stefan.fuhrmann_at_wandisco.com>
Date: Sun, 18 Nov 2012 23:52:44 +0100

Hi there,

For those of you that are facts and numbers crazy,
I attached some data size info for 3 large FSFS
repositories. They are 1.8-format mirrors of the
Apache, KDE and wordpress repositories. I used
my new fsfs-stats tool to extract the info.

Some of my findings:

* Apache: lots of large zip files added lately
  (low overall compression rate but tool does not
   list zip files etc. as the reason - yet)
* KDE: still larger then Apache with an excellent
  compression ratio (lots of large .po files); >1TB
* Wordpress: directory compression eliminated
  directory storage overhead (5000% => <10%)

* rep sharing is most effective when you have many
  "casual" users (> factor 2 in wordpress; 25% savings
  for Apache; insignificant for KDE since po files
  are not shared / identical between branches)
* noderevs + changes list takes up 10..30% of
  the total repo size, i.e. actual content already
  well compressed

* more different file props reps than I thought
  (probably due to per-file old merge info)
* >50% of all nodes in Apache repo have props
* rep sharing + deltification brings prop info down
  to ~10 bytes / rev for Apache

-- Stefan^2.

-- 
Certified & Supported Apache Subversion Downloads:
*
http://www.wandisco.com/subversion/download
*



Received on 2012-11-18 23:53:17 CET

This is an archived mail posted to the Subversion Dev mailing list.

This site is subject to the Apache Privacy Policy and the Apache Public Forum Archive Policy.