Discussion
Loading...

Post

Log in
  • About
  • Code of conduct
  • Privacy
  • Users
  • Instances
  • About Bonfire
bebatjof  馃嚨馃嚫
bebatjof 馃嚨馃嚫
@bebatjof@twoot.site  路  activity timestamp 2 weeks ago

okay so ahum, what I did there on line 8 is correct? (And why isn't this opt-in instead of opt-out?)

#neocities #mastodonhelp #antiAI

Screenshot
# This is the default rule, which allows search engines to crawl your site (recommended).
User-agent: * 
Allow: / 

# If you do not want AI bots to crawl your site, remove the # from the following lines:
User-agent: AI2Bot
#User-agent: Ai2Bot-Dolma
#User-agent: Amazonbot
[list continues]
Screenshot # This is the default rule, which allows search engines to crawl your site (recommended). User-agent: * Allow: / # If you do not want AI bots to crawl your site, remove the # from the following lines: User-agent: AI2Bot #User-agent: Ai2Bot-Dolma #User-agent: Amazonbot [list continues]
Screenshot # This is the default rule, which allows search engines to crawl your site (recommended). User-agent: * Allow: / # If you do not want AI bots to crawl your site, remove the # from the following lines: User-agent: AI2Bot #User-agent: Ai2Bot-Dolma #User-agent: Amazonbot [list continues]
  • Copy link
  • Flag this post
  • Block
bebatjof  馃嚨馃嚫
bebatjof 馃嚨馃嚫
@bebatjof@twoot.site  路  activity timestamp 2 weeks ago

I mean, this is a bit confusing. I removed the # signs. Should I have moved the whole list to the disallow section? That sounds like it makes a lot of sense, but I'm just following the instructions.

lower end of the list going
User-agent: YouBot
#Disallow: /
lower end of the list going User-agent: YouBot #Disallow: /
lower end of the list going User-agent: YouBot #Disallow: /
  • Copy link
  • Flag this comment
  • Block
Arthur Clemens
Arthur Clemens
@ArthurClemens@todon.nl  路  activity timestamp 2 weeks ago

@bebatjof The last line is the instruction that will disallow the agents listed above it. Remove the # character to activate it.

See for example https://robotstxt.com/ai

AI / LLM User-Agents: Blocking Guide

Find out how to block your content from being used for AI/LLM training with robots.txt. Created by ex-Google engineer Fili.
  • Copy link
  • Flag this comment
  • Block

bonfire.cafe

A space for Bonfire maintainers and contributors to communicate

bonfire.cafe: About 路 Code of conduct 路 Privacy 路 Users 路 Instances
Bonfire social 路 1.0.2-alpha.34 no JS en
Automatic federation enabled
Log in
Instance logo
  • Explore
  • About
  • Members
  • Code of Conduct