[svn.haxx.se] · SVN Dev · SVN Users · SVN Org · TSVN Dev · TSVN Users · Subclipse Dev · Subclipse Users · this month's index

Re: SVN Book Method for Splitting Repos doesn't work

From: Patrick Smears <patrick.smears_at_ensoft.co.uk>
Date: 2004-12-30 23:24:28 CET

On Thu, 30 Dec 2004, Max Bowsher wrote:

> [...]
> Any small overhead can become quite magnified when dealing with gigabytes of
> data, and if you want to restrict the available byte values to printable
> ASCII, then the amount of space required to store arbitrary data will
> increase by approximately a factor of 3.

In general I agree 100% with what you're saying - but I'm puzzled as to
where the factor of 3 comes in? I'd have thought that, with some sort of
base64 encoding, you'd be able to store 6 bits of "real" data for every 8
bits of "encoded" data - give or take some overhead for padding, sensible
line breaks etc - so I'd have thought a figure of 30-40% extra would seem
more likely... playing with uuencode would seem to confirm this:

% head -c1000000 /dev/urandom | uuencode - | wc -c

Indeed, I'd have thought that storing each "real" byte as two hex digits
would only double the output size... What am I missing here?


The easy way to type accents in Windows: http://www.frkeys.com/
To unsubscribe, e-mail: users-unsubscribe@subversion.tigris.org
For additional commands, e-mail: users-help@subversion.tigris.org
Received on Thu Dec 30 23:27:26 2004

This is an archived mail posted to the Subversion Users mailing list.

This site is subject to the Apache Privacy Policy and the Apache Public Forum Archive Policy.