[svn.haxx.se] · SVN Dev · SVN Users · SVN Org · TSVN Dev · TSVN Users · Subclipse Dev · Subclipse Users · this month's index

Re: stopping webcrawlers using robots.txt

From: Todd D. Esposito <Todd_at_ToddEsposito.com>
Date: 2006-07-10 01:44:50 CEST

Thomas,

Try inserting something like what I've indicated below, inline with your
vhost block:

On Sun, July 9, 2006 13:02, Thomas Beale said:
>
> Hi,
>
> I have looked around but not found the answer to the question: how to
> make /robots.txt visible in an apache virtual host config for a
> subversion server. How would I tell Apache to allow requests to read
> /robots.txt given the following configuration? (Or - how can I just
> block robots going into the SVN repositories)?
>
> <VirtualHost 1.2.3.4>
> ServerAdmin webmaster@xxxx.org
>
> ServerName svn.xxxx.org

Alias /robots.txt /some/non/svn/path/robots.txt
<Location /robots.txt>
  SetHandler default-handler
</Location>

>
> <Location />
> DAV svn
> SVNParentPath /usr/local/var/svn
>
> # authorisation
> AuthzSVNAccessFile /etc/subversion/access-control
>
> # authentication
> AuthType Basic
> AuthName "development Subversion Repository"
> AuthUserFile /etc/subversion/authentication
>
> # anonymous access rules
> Satisfy Any
> Require valid-user
> </Location>
> </VirtualHost>

That should do it, but be warned I didn't test it before sending it off;
YMMV.

Todd D. Esposito
www.Turtol.com -- Web Applications and Hosting
Todd@Turtol.com
Todd@ToddEsposito.com

---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe@subversion.tigris.org
For additional commands, e-mail: users-help@subversion.tigris.org
Received on Mon Jul 10 01:46:49 2006

This is an archived mail posted to the Subversion Users mailing list.

This site is subject to the Apache Privacy Policy and the Apache Public Forum Archive Policy.