Discussion
Loading...

Post

  • About
  • Code of conduct
  • Privacy
  • Users
  • Instances
  • About Bonfire
BSD Cafe Announcements
@announcements@mastodon.bsd.cafe  ·  activity timestamp 2 days ago

The #BSDCafe #Forgejo instance ("brew") is constantly under "attack" by scrapers. It was killing the performance and making the entire BSD Cafe slower.

I've added some limits and workarounds - but you could see, from time to time, some 429 errors.

#BSDCafeUpdates #BSDCafeAnnouncements

  • Copy link
  • Flag this post
  • Block
Torsten Senf
@sentor@digitalcourage.social replied  ·  activity timestamp 2 days ago

@announcements saw #iocaine https://iocaine.madhouse-project.org by @tante few days ago … can help maybe

  • Copy link
  • Flag this comment
  • Block
algernon the beaming
@algernon@come-from.mad-scientist.club replied  ·  activity timestamp 2 days ago

@announcements FWIW, you can get rid of a lot of those scrapers by catching all the self-identifying ones listed in ai.robots.txt, and on top of that, if anything says Firefox/ or Chrome/ in the user agent that does not also have a sec-fetch-mode header. Catch those, serve them a empty 200, or a 403, or 418, or heck, a 429, and a lot of the bots will be gone.

(Mind you, this also catches Googlebot and Bingbot, which I personally consider a win, but you might want to make an exception for them if you want to appear in search results.)

GitHub

GitHub - ai-robots-txt/ai.robots.txt: A list of AI agents and robots to block.

A list of AI agents and robots to block. Contribute to ai-robots-txt/ai.robots.txt development by creating an account on GitHub.
  • Copy link
  • Flag this comment
  • Block
Stefano Marinelli
@stefano@mastodon.bsd.cafe replied  ·  activity timestamp 2 days ago

@algernon @announcements thank you!

  • Copy link
  • Flag this comment
  • Block
Log in

bonfire.cafe

A space for Bonfire maintainers and contributors to communicate

bonfire.cafe: About · Code of conduct · Privacy · Users · Instances
Bonfire social · 1.0.0 no JS en
Automatic federation enabled
  • Explore
  • About
  • Members
  • Code of Conduct
Home
Login