[svn.haxx.se] · SVN Dev · SVN Users · SVN Org · TSVN Dev · TSVN Users · Subclipse Dev · Subclipse Users · this month's index

Re: Trival merge of big text file: Dismal performance, 540x faster if binary.

From: Johan Corveleyn <jcorvel_at_gmail.com>
Date: Thu, 13 Jan 2011 15:25:35 +0100

On Thu, Jan 13, 2011 at 2:07 PM, Stefan Sperling <stsp_at_elego.de> wrote:
> On Thu, Jan 13, 2011 at 01:55:58PM +0100, Johan Corveleyn wrote:
>> Textual merging in svn makes use of a variant of the standard diff
>> algorithm, namely diff3. Just a couple of days ago, I finally
>> succeeded in making diff3 take advantage of those performance
>> improvements (haven't committed this to the branch yet, but maybe I'll
>> get to it tonight).
>>
>> Would you be able to build an svn client from source? If so, could you
>> perhaps build a client from
>> http://svn.apache.org/repos/asf/subversion/branches/diff-optimizations-bytes
>> ?
>
> Hey Johan,
>
> I would be interested in doing testing and reviewing the changes
> on your branch. There might still be enough time to get them into 1.7.

Thanks, that would be great (btw, danielsh also expressed an interest
in reviewing the branch). I will try to give an status update on the
dev-list after I've committed the changes for diff3.

> I don't have any suitably large XML files though.
> If you and/or Andreas could provide some that would be great.

I was thinking of writing a python script (as philip already
suggested) that can generate several variants of large files with
semi-random data. I have some prototype code for this lying around, so
if I find the time, I'll try to wrap this up and send it to the dev
list. OTOH, real-world examples are probably even better.

Cheers,

-- 
Johan
Received on 2011-01-13 15:26:33 CET

This is an archived mail posted to the Subversion Users mailing list.

This site is subject to the Apache Privacy Policy and the Apache Public Forum Archive Policy.