[svn.haxx.se] · SVN Dev · SVN Users · SVN Org · TSVN Dev · TSVN Users · Subclipse Dev · Subclipse Users · this month's index

Re: improving subversion treatment of compressed XML/text file formats

From: David Kaplan <David.Kaplan_at_ird.fr>
Date: Fri, 24 Oct 2008 16:21:54 +0200


On Wed, 2008-10-22 at 17:45 -0500, Ryan Schmidt wrote:

> Subversion stores all files in the repository as differences against
> previous versions. It does not differentiate between text or binary
> files at this point. However, depending on the compression algorithm,
> compressed files don't necessarily lend themselves to efficient
> diffing, which can result in them taking more space in the repository
> over time than the uncompressed versions would have.

I wasn't sure about this point, but my experience is that small changes
to a document seem to produce large diffs in the compressed version
leading to a large repository.

> Note that an OpenOffice.org file is not a compressed text file, but a
> compressed directory of several text files.

Yes, but this shouldn't be too difficult to handle as at least the
standard linux diff command diffs directories without difficulty (say
that three times fast). I believe that subversion uses its own
algorithm, but handling directories can't be too hard. One option would
be to tar the directory into one file that would still be human


David M. Kaplan
Charge de Recherche 1
Institut de Recherche pour le Developpement
Centre de Recherche Halieutique Mediterraneenne et Tropicale
av. Jean Monnet
B.P. 171
34203 Sete cedex
Phone: +33 (0)4 99 57 32 27
Fax: +33 (0)4 99 57 32 95
To unsubscribe, e-mail: users-unsubscribe_at_subversion.tigris.org
For additional commands, e-mail: users-help_at_subversion.tigris.org
Received on 2008-10-24 17:36:42 CEST

This is an archived mail posted to the Subversion Users mailing list.

This site is subject to the Apache Privacy Policy and the Apache Public Forum Archive Policy.