[svn.haxx.se] · SVN Dev · SVN Users · SVN Org · TSVN Dev · TSVN Users · Subclipse Dev · Subclipse Users · this month's index

Re: Lockups on large initial import

From: Matt Pounsett <matt.pounsett_at_cira.ca>
Date: 2004-09-22 14:01:01 CEST

Another self-followup:

On Sep 10, 2004, at 21:00, Matt Pounsett wrote:

> I re-ran my tests today using svnserve instead of Apache, and had no
> problem. I moved back to Apache and got this during the Transferring
> Data phase:
> svn: Commit failed (details follow):
> svn: At least one property change failed; repository is unchanged
>
> ... and the db is now locked up.
>
> I ran your script over an http connection and got a failure as well.
>
> Are any of the developers watching this thread? This seems pretty
> consistently repeatable.. is there any further troubleshooting you'd
> like us to do to help track down the problem?

More info on this particular issue.

I managed to catch another "At least one property change failed" error
this morning, and here's what I found. It appears to me as if the
problem has something to do with mod_dav_svn failing to remove db locks
as Apache expires its children and HTTP requests get passed off to new
children.

After getting the error, I checked on my httpd processes, and found a
few of them waiting for futex calls to return. Now here's where it
gets interesting... (note for completeness that I'm the only one
accessing this httpd at the moment, and I'm only running one svn client
at a time):

% lsof __db*
COMMAND PID USER FD TYPE DEVICE SIZE NODE NAME
httpd 12883 apache mem REG 9,0 16384 3597498 __db.005
httpd 12883 apache mem REG 9,0 917504 3597497 __db.004
httpd 12883 apache mem REG 9,0 327680 3597496 __db.003
httpd 12883 apache mem REG 9,0 278528 3597495 __db.002
httpd 12883 apache mem REG 9,0 16384 3597494 __db.001
httpd 12886 apache mem REG 9,0 16384 3597498 __db.005
httpd 12886 apache mem REG 9,0 917504 3597497 __db.004
httpd 12886 apache mem REG 9,0 327680 3597496 __db.003
httpd 12886 apache mem REG 9,0 278528 3597495 __db.002
httpd 12886 apache mem REG 9,0 16384 3597494 __db.001
httpd 12889 apache mem REG 9,0 16384 3597498 __db.005
httpd 12889 apache mem REG 9,0 917504 3597497 __db.004
httpd 12889 apache mem REG 9,0 327680 3597496 __db.003
httpd 12889 apache mem REG 9,0 278528 3597495 __db.002
httpd 12889 apache mem REG 9,0 16384 3597494 __db.001

% strace -p 12883 -p 12886 -p 12889
Process 12883 attached - interrupt to quit
Process 12886 attached - interrupt to quit
Process 12889 attached - interrupt to quit
[pid 12883] futex(0xb6eca2b0, FUTEX_WAIT, 2, NULL <unfinished ...>
[pid 12886] futex(0xb6f0d5c8, FUTEX_WAIT, 2, NULL <unfinished ...>
[pid 12889] futex(0xb6f0d5c8, FUTEX_WAIT, 2, NULL <unfinished ...>
Process 12883 detached
Process 12886 detached
Process 12889 detached

So the only httpd processes accessing the database are all waiting for
some other process to release a lock.

Matt Pounsett Canadian Internet Registration Authority
Technical Support Programmer 350 Sparks Street, Suite 1110
matt.pounsett@cira.ca Ottawa, Ontario, Canada
613.237.5335 ext. 231 http://www.cira.ca

---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe@subversion.tigris.org
For additional commands, e-mail: users-help@subversion.tigris.org
Received on Wed Sep 22 14:01:40 2004

This is an archived mail posted to the Subversion Users mailing list.

This site is subject to the Apache Privacy Policy and the Apache Public Forum Archive Policy.