[svn.haxx.se] · SVN Dev · SVN Users · SVN Org · TSVN Dev · TSVN Users · Subclipse Dev · Subclipse Users · this month's index

RE: Trival merge of big text file: Dismal performance, 540x faster if binary.

From: Tony Sweeney <tsweeney_at_omnifone.com>
Date: Thu, 13 Jan 2011 15:58:39 -0000

Why bother with a script? Just wget a few high traffic websites (slashdot, yahoo, dailykos, google news) or similar into a file every now and again.

Tony.

> -----Original Message-----
> From: Johan Corveleyn [mailto:jcorvel_at_gmail.com]
> Sent: 13 January 2011 14:26
> To: krueger, Andreas (Andreas Krüger, DV-RATIO);
> users_at_subversion.apache.org
> Subject: Re: Trival merge of big text file: Dismal
> performance, 540x faster if binary.
>
> On Thu, Jan 13, 2011 at 2:07 PM, Stefan Sperling
> <stsp_at_elego.de> wrote:
> > On Thu, Jan 13, 2011 at 01:55:58PM +0100, Johan Corveleyn wrote:
> >> Textual merging in svn makes use of a variant of the standard diff
> >> algorithm, namely diff3. Just a couple of days ago, I finally
> >> succeeded in making diff3 take advantage of those performance
> >> improvements (haven't committed this to the branch yet, but maybe
> >> I'll get to it tonight).
> >>
> >> Would you be able to build an svn client from source? If so, could
> >> you perhaps build a client from
> >>
> http://svn.apache.org/repos/asf/subversion/branches/diff-optimization
> >> s-bytes
> >> ?
> >
> > Hey Johan,
> >
> > I would be interested in doing testing and reviewing the changes on
> > your branch. There might still be enough time to get them into 1.7.
>
> Thanks, that would be great (btw, danielsh also expressed an
> interest in reviewing the branch). I will try to give an
> status update on the dev-list after I've committed the
> changes for diff3.
>
> > I don't have any suitably large XML files though.
> > If you and/or Andreas could provide some that would be great.
>
> I was thinking of writing a python script (as philip already
> suggested) that can generate several variants of large files
> with semi-random data. I have some prototype code for this
> lying around, so if I find the time, I'll try to wrap this up
> and send it to the dev list. OTOH, real-world examples are
> probably even better.
>
> Cheers,
> --
> Johan
>
> ______________________________________________________________________
> This email has been scanned by the MessageLabs Email Security System.
> For more information please visit
> http://www.messagelabs.com/email
> ______________________________________________________________________
>
Received on 2011-01-13 16:59:21 CET

This is an archived mail posted to the Subversion Users mailing list.

This site is subject to the Apache Privacy Policy and the Apache Public Forum Archive Policy.