Binary differencing performs poorly (erractically) on very large text file (REPOST)
From: Charles Butterfield <charles.butterfield_at_nextcentury.com>
Date: Fri, 15 Feb 2008 18:49:21 -0500
I have a large text file (around 47 MB) which is a database dump
Doing a "diff" on the client side files always generates a "relatively"
Observations:
1) Something about the binary differencing is working erratically,
only for this large file, in the sense of not always noticing that
differences between two revs are much smaller than the entire file.
2) Compression, on the other hand, seems fine.
Some numbers:
Repo Client Client Diff
Rev (KB) File (KB) Diff (KB)
-------- --------- ---------
11,532 53,942 n/a
11,532 53,943 166
335 53,961 656
11,594 54,023 244
89 54,023 1
Reproducible on a variety of clients and servers, most recent of which
Questions:
1) Is this a known problem?
2) Is there some easy way I could force the client and/or server to spew
debug information that would be of assistance to some guru?
3) Should I open a problem report, or just send this to the "dev" list?
Thanks
-- Charlie Butterfield
|
This is an archived mail posted to the Subversion Users mailing list.
This site is subject to the Apache Privacy Policy and the Apache Public Forum Archive Policy.