On 12/3/2010 10:07 AM, Brian Brophy wrote:
> I realize this is not directly related to SVN itself; however, I am
> hoping this community may have some suggestions. If there is a more
> appropriate forum I should be engaging, please let me know.
>
> We have a repo with over 125 GB of data, containing everything from
> source code to requirements documents, etc. The repo is accessible via
> https. We'd like to be able to search the contents of the repo. A use
> case may be taking a phrase of interest, some text, and finding
> occurrences within the repo where this text resides (ie, within the
> actual source code or documents).
>
> I have considered pointing something like a search appliance at the
> https interface and letting it crawl/spider/index the data. That could
> be one option.
>
> And yes, one could checkout/update the repo to search it, but at 125 GB
> and growing that is a cumbersome approach to do many client-side searches.
>
> Would anyone have any other options? Has anyone done something similar?
The commercial fisheye product does this:
http://www.atlassian.com/software/fisheye/ and it has some other
features - but it's not free.
You might be able to roll some kind of search out of htdig, perhaps
pointing it at viewvc instead of the subversion view.
If you are mostly interested in the head version and mostly in code, you
can point opengrok:
http://hub.opensolaris.org/bin/view/Project+opengrok/WebHome
at a checked out copy of a project and it will give you a search with
linked cross-references between code definitions and references plus
some support for accessing the version log.
--
Les Mikesell
lesmikesell_at_gmail.com
Received on 2010-12-03 17:26:17 CET