Re: SVN Book Method for Splitting Repos doesn't work

From: Patrick Smears <patrick.smears_at_ensoft.co.uk>
Date: 2004-12-30 23:24:28 CET

On Thu, 30 Dec 2004, Max Bowsher wrote:

> [...]
> Any small overhead can become quite magnified when dealing with gigabytes of
> data, and if you want to restrict the available byte values to printable
> ASCII, then the amount of space required to store arbitrary data will
> increase by approximately a factor of 3.

In general I agree 100% with what you're saying - but I'm puzzled as to
where the factor of 3 comes in? I'd have thought that, with some sort of
base64 encoding, you'd be able to store 6 bits of "real" data for every 8
bits of "encoded" data - give or take some overhead for padding, sensible
line breaks etc - so I'd have thought a figure of 30-40% extra would seem
more likely... playing with uuencode would seem to confirm this:

% head -c1000000 /dev/urandom | uuencode - | wc -c
1377800

Indeed, I'd have thought that storing each "real" byte as two hex digits
would only double the output size... What am I missing here?

Patrick

-- 
The easy way to type accents in Windows: http://www.frkeys.com/
---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe@subversion.tigris.org
For additional commands, e-mail: users-help@subversion.tigris.org

Received on Thu Dec 30 23:27:26 2004

This message: [ Message body ]
Next message: Christopher Ness: "Re: workspace corruption issues"
Previous message: allan juul: "Re: svn copy - tags - avoid copying parent dir"
In reply to: Max Bowsher: "Re: SVN Book Method for Splitting Repos doesn't work"
Next in thread: Max Bowsher: "Re: SVN Book Method for Splitting Repos doesn't work"
Reply: Max Bowsher: "Re: SVN Book Method for Splitting Repos doesn't work"

Contemporary messages sorted: [ By Date ] [ By Thread ] [ By Subject ] [ By Author ] [ By messages with attachments ]