Re: Review of lock-many API

From: Philip Martin <philip.martin_at_wandisco.com>
Date: Fri, 28 Mar 2014 19:52:59 +0000

Julian Foad <julianfoad_at_btopenworld.com> writes:

> Philip Martin wrote:
>> Julian Foad writes:
>>>>Â URL: http://svn.apache.org/r1577280
>>>>Â * subversion/include/svn_fs.h
>>>>Â Â (svn_fs_lock_target_t, svn_fs_lock_result_t,
>>>>Â Â Â svn_fs_lock2, svn_fs_unlock2): new.
>>>>
>>>>Â * subversion/include/svn_repos.h
>>>>Â Â (svn_repos_fs_lock2, svn_repos_fs_unlock2): new.
>>>
>>>Â Do we intend to deprecate the old versions? If not, then it would be
>>>Â better to name the new functions something like ...lock_many() instead
>>>Â of ...lock2().
>>
>> I'm unsure.Â In some ways it is convenient to have a single path
>> function as it avoids the need to construct hashes and the error
>> handling is simpler and less prone to leak.Â On the other hand when
>> svn_client_copy4 introduced multi-path copy source the single path
>> version was deprecated.
>
> Thoughts...
>
> A low level API such as libsvn_fs should aim to be clean and
> efficient, more so than to provide functions that are convenient for
> the casual user. (A high level API such as libsvn_client or the
> bindings is a more appropriate place to provide convenience
> functions.) We do have a number of APIs already, in several libraries,
> where the caller is expected to pass in a list of targets even if they
> only want to operate on one. Maybe in general the thing to aim for is
> to make it easy for a caller to do so.
>
> We could provide a singleton constructor for the
> hash-of-svn_fs_lock_target_t argument. Should we? Probably not at the
> libsvn_fs level; maybe at a high level.

I'l look at doing that.

> I started reading further through the new code. Here are some more
> review comments.
>
> The 'comment' parameter to svn_fs_lock2 should be a member of the
> 'target' struct, since it is inherently a per-target attribute. Of
> course there are common scenarios where a client wants to lock a bunch
> of paths all with the same comment, but the benefits of a lock-many
> API should also be made available to clients which supply different
> comments. This applies all the way up the call stack. However, I note
> that from the RA layer upwards we have had lock-many APIs since v1.2
> which take a single comment for all the paths, so changing to
> per-target comments throughout the stack is perhaps beyond the scope
> of this issue. We could still do it at the FS and repos layers now in
> preparation; I don't see that it would add significant overhead in
> run-time or in maintenance.

I think per-lock comments in a multi-path lock are unneccesary. I don't
think anybody locking dozens/hundreds of files will bother defining
different comments. Anyone who wants two or three locks with distinct
comments can call the function multiple times.

> In principle, the 'is_dav_comment' and 'expiration_date' parameters
> should similarly be per target, but that makes less sense in practice
> as they're only used for generic DAV clients via mod_dav_svn. As an
> alternative, we might consider dropping them from this API and keeping
> the original single-lock API (not deprecated) to cater for such
> locks. RA-local and svnserve and 'svnadmin lock' don't support
> expiration and always pass is_dav_comment=FALSE. Only mod_dav_svn
> supplies these two options, and it currently only uses the old
> one-lock-at-a-time API anyway.

Eventually we will define a new request and mod_dav_svn will start
calling the multiple path version. It will still handle the current
LOCK request but may use the multi-path function to do it.

> We should provide a destructor for the hash-of-svn_fs_lock_result_t
> results, which contains errors that need to be cleared. svnserve's
> lock_many() already contains such code twice. It's simple code --
> around 5 lines -- but if we're demanding that callers ensure they do
> it, it's nice to provide a ready-made way for them to do it. This
> would also help in future-proofing against us revising the API in
> later releases.

I have a look at that.

> The semantics relating to returning one error per path and an overall
> error needs to be fully documented and/or changed. For example,
> svn_repos_fs_lock() assumes that svn_repos_fs_lock2() has set *results
> to a valid hash if it returns any overall error, while svn_fs_lock()
> assumes that svn_fs_lock2() WON'T have put any error in *results if it
> returns an error overall. These look at least surprising.
>

That's an error in libsvn_fs, libsvn_repos does it right. Fixed in
r1582845.

> +Â /* [JAF] Is that a question to reviewers? It depends. What errors can
> +Â Â Â Â Â Â Â Â Â Â svn_fs_lock2 return, and do any of them justify not running the
> +Â Â Â Â Â Â Â Â Â Â post-lock hook? Can it even return an error and also create
> +Â Â Â Â Â Â Â Â Â Â locks? Its doc string needs to say. */
> Â Â if (err)
> Â Â Â Â return svn_error_trace(err);

It's almost always possible for fs functions to return arbitrary errors
due to filesystem permissions, lack of disk space etc. This could
happen part way through creating locks. I'm not sure whether we should
attempt to run the post-hook.

> Index: subversion/svnserve/serve.c
> ===================================================================
> --- subversion/svnserve/serve.cÂ Â Â (revision 1582225)
> +++ subversion/svnserve/serve.cÂ Â Â (working copy)
> @@ -2733,7 +2733,8 @@ static svn_error_t *lock_many(svn_ra_svn
> Â Â Â Â Â an error. */
> Â Â SVN_ERR(must_have_access(conn, pool, b, svn_authz_write, NULL, TRUE));
>
> -Â /* Loop through the lock requests. */
> +Â /* Loop through the lock requests
> +Â Â Â Â ### and do what? */

I just copied that comment from the old code. I probably should have
removed it.

> Â Â for (i = 0; i < path_revs->nelts; ++i)
> Â Â Â Â {
> Â Â Â Â Â Â const char *path, *full_path;
> @@ -2759,6 +2760,8 @@ static svn_error_t *lock_many(svn_ra_svn
> Â Â Â Â Â Â target->current_rev = current_rev;
>
> Â Â Â Â Â Â /* We could check for duplicate paths and reject the request? */
> +Â Â Â Â Â /* ### [JAF] Is that a question to reviewers? We could, but I
> +Â Â Â Â Â Â Â Â don't think it's useful to do so. */

I brought up the handling of canonical paths on the dev list, it wasn't
conclusive from my point of view so I just implemented something. Given
that FS generally accepts non-canonical paths it is hard to know what to
do.

> Â Â Â Â Â Â svn_hash_sets(targets, full_path, target);
> Â Â Â Â }
>
> @@ -2767,6 +2770,11 @@ static svn_error_t *lock_many(svn_ra_svn
>
> Â Â /* From here on we need to make sure any errors in authz_results, or
> Â Â Â Â Â results, are cleared before returning from this function. */
> +
> +Â /* Check authz access for each target, because ...
> +Â Â Â Â ### Why? We don't want svn_repos_fs_lock2() to do this for us ...?
> +Â Â Â Â Â Â Â Â Or, we want to log such errors and it's easier to do so before
> +Â Â Â Â Â Â Â Â than afterwards? */

That's the way servers/repos functions work.

> Â Â for (hi = apr_hash_first(pool, targets); hi; hi = apr_hash_next(hi))
> Â Â Â Â {
> Â Â Â Â Â Â const char *full_path = svn__apr_hash_index_key(hi);
> @@ -2791,7 +2799,8 @@ static svn_error_t *lock_many(svn_ra_svn
> Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â 0, /* No expiration time. */
> Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â Â steal_lock, pool, subpool);
>
> -Â /* The client expects results in the same order as paths were supplied. */
> +Â /* Send the results. The client expects results in the same order as
> +Â Â Â Â paths were supplied. */
> Â Â for (i = 0; i < path_revs->nelts; ++i)
> Â Â Â Â {
> Â Â Â Â Â Â const char *path, *full_path;
> @@ -2816,6 +2825,16 @@ static svn_error_t *lock_many(svn_ra_svn
> Â Â Â Â Â Â Â Â result = svn_hash_gets(authz_results, full_path);
> Â Â Â Â Â Â if (!result)
> Â Â Â Â Â Â Â Â /* No result?Â Should we return some sort of placeholder error? */
> +Â Â Â Â Â Â Â /* ### [JAF] Is that a question to reviewers? Certainly something
> +Â Â Â Â Â Â Â Â Â Â has gone wrong. Maybe svn_repos_fs_lock2 returned an error
> +Â Â Â Â Â Â Â Â Â Â overall, having processed none or only some paths -- is it
> +Â Â Â Â Â Â Â Â Â Â allowed to do so?

It could run out of disk space half way through.

> Breaking here would signal to the client that
> +Â Â Â Â Â Â Â Â Â Â something went wrong, because they'll see a too-short response
> +Â Â Â Â Â Â Â Â Â Â list, but would also potentially hide information about further
> +Â Â Â Â Â Â Â Â Â Â targets that were processed, some of which perhaps were locked
> +Â Â Â Â Â Â Â Â Â Â successfully. (Note: we're scanning them here in a different
> +Â Â Â Â Â Â Â Â Â Â order from the order in which svn_repos_fs_lock2() processed
> +Â Â Â Â Â Â Â Â Â Â them.) */
> Â Â Â Â Â Â Â Â break;
>
> Â Â Â Â Â Â if (result->err)

-- 
Philip Martin | Subversion Committer
WANdisco // *Non-Stop Data*

Received on 2014-03-28 20:53:37 CET

This message: [ Message body ]
Next message: Philip Martin: "Re: svn commit: r1582845 - /subversion/trunk/subversion/libsvn_fs/fs-loader.c"
Previous message: Bert Huijben: "RE: svn commit: r1582845 - /subversion/trunk/subversion/libsvn_fs/fs-loader.c"
In reply to: Julian Foad: "Review of lock-many API [was: svn commit: r1577280 [1/3] ...]"

Contemporary messages sorted: [ by date ] [ by thread ] [ by subject ] [ by author ] [ by messages with attachments ]