[ZWeb] Zope.org currently unusable
    Mark Pratt 
    mark at zopemag.com
       
    Thu Mar 10 10:38:31 EST 2005
    
    
  
Jens,
You are correct that the crawl-delay parameter is not part of the spec.
Their are plenty of examples where specs don't keep up with the times 
and their is no harm done using that tag.
Worrying about a robots.txt file is a bit over the top :-)
Thanks for the link to the reckless/useless user agents page.
Cheers,
Mark
On Mar 10, 2005, at 3:39 PM, Jens Vagelpohl wrote:
>
> On Mar 10, 2005, at 15:27, Andrew Sawyers wrote:
>
>> I need to read up on the robots.txt spec.  Excellent Mark, thanks.
>> Andrew
>
> That piece is not part of the spec. Just like the wildcards that 
> Google claims they use (and I still don't believe that works as 
> advertised). This is the spec:
>
> http://www.robotstxt.org/wc/norobots.html
>
> Here is a robots.txt validator:
>
> http://www.searchengineworld.com/cgi-bin/robotcheck.cgi
>
> Here's a funny one: Some collected all the reckless/useless user 
> agents for exclusion:
>
> http://www.searchenginegenie.com/Dangerous-user-agents.htm
>
> This one explains Slurp-specific extensions:
>
> http://help.yahoo.com/help/us/ysearch/slurp/slurp-03.html
>
> jens
>
> _______________________________________________
> Zope-web maillist  -  Zope-web at zope.org
> http://mail.zope.org/mailman/listinfo/zope-web
>
>
    
    
More information about the Zope-web
mailing list