[svn.haxx.se] · SVN Dev · SVN Users · SVN Org · TSVN Dev · TSVN Users · Subclipse Dev · Subclipse Users · this month's index

Full text search of an svn repository

From: Scott Mohekey <smohekey_at_insightful.co.nz>
Date: 2004-07-27 22:18:28 CEST

I've been following the work at subissue.tigris.org and would like to help
address the area of performing a text search of an svn repository.

I have two usage scenarios that I'd like to explore, that of storing issues in
a subversion repository, and that of using a subversion repository as the
backend for a wiki.

Both scenarios require being able to perform a full text search of the latest
revision, which could easily be achieved via grep over a working repository.
However, for the wiki scenario in particular, it would be desirable to be
able to search over the complete revision history. This could also be
achieved via grep, however would require (as far as I can tell) having a
working repository of each revision (or at least each changeset).

In the case of searching over all revisions (or a range there of), there are a
number of different types of results that could be desired. Matches for a
given file could be returned for all matching revisions, the first matching
revision, the last matching revision, or some other combination of the
available matches. These could be in a textual form, or more preferably, an
xml form.

As far as I can tell, this sort of searching would need to be performed on the
subversion database itself. Am I right in this assumption?

Is there anyone else that has attempted anything like this, or can anyone tell
me where I should start looking in the source code? Where is the best place
for me to look for information on the database format of subversion? I've
already read a good deal of the Berkeley DB manuals.

Thanks for your time,

Scott Mohekey.

To unsubscribe, e-mail: dev-unsubscribe@subversion.tigris.org
For additional commands, e-mail: dev-help@subversion.tigris.org
Received on Tue Jul 27 10:17:50 2004

This is an archived mail posted to the Subversion Dev mailing list.

This site is subject to the Apache Privacy Policy and the Apache Public Forum Archive Policy.