[svn.haxx.se] · SVN Dev · SVN Users · SVN Org · TSVN Dev · TSVN Users · Subclipse Dev · Subclipse Users · this month's index

Re: Search subversion binary content

From: C. Michael Pilato <cmpilato_at_collab.net>
Date: 2005-10-10 13:21:05 CEST

"Daniel L. Rall" <dlr@finemaltcoding.com> writes:

> > I really need to upload the lucene/libsvn_fs hookup I did in python.
> > It actually scans and indexes all repository history. Doesn't really
> > work on binary files, though. ;-)
>
> Yeah, but you got it -- this is exactly the type of crawler I was referring
> to. Wrapping access -- the "visit" -- to each piece of content in the
> appropriate library (e.g. OLE, PDF, etc.) which can turn it into textual
> data would allow for indexing, and thus allow for searches of that index.

Yep. Man, I'd love to unleash Stellent's Outside In(tm) document
strainers (which Ben and I used to write/support) on a Subversion
indexing engine...

-- 
C. Michael Pilato <cmpilato@collab.net>
CollabNet   <>   www.collab.net   <>   Distributed Development On Demand
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscribe@subversion.tigris.org
For additional commands, e-mail: dev-help@subversion.tigris.org
Received on Mon Oct 10 13:23:03 2005

This is an archived mail posted to the Subversion Dev mailing list.