Discussion
Loading...

#Tag

Log in
  • About
  • Code of conduct
  • Privacy
  • Users
  • Instances
  • About Bonfire
Ross Mounce boosted
George Macgregor
George Macgregor
@g3om4c@code4lib.social  ·  activity timestamp yesterday

Very helpfully @coar_repositories have published the outcome of the 'Dealing with #Bots Task Group'. Thanks! This includes a toolkit for #repository managers devised around the phases: Preparation, Protection and Mitigation. Essential reading for any institution with one or more repositories -- or indeed any digital library or archive. Let's bust the abusive bots!

Dealing With Bots: A #COAR Resource for Repository Managers
https://dealing-with-bots.coar-repositories.org/ #repositories #DigitalLibraries #LLMs

  • Copy link
  • Flag this post
  • Block
George Macgregor
George Macgregor
@g3om4c@code4lib.social  ·  activity timestamp yesterday

Very helpfully @coar_repositories have published the outcome of the 'Dealing with #Bots Task Group'. Thanks! This includes a toolkit for #repository managers devised around the phases: Preparation, Protection and Mitigation. Essential reading for any institution with one or more repositories -- or indeed any digital library or archive. Let's bust the abusive bots!

Dealing With Bots: A #COAR Resource for Repository Managers
https://dealing-with-bots.coar-repositories.org/ #repositories #DigitalLibraries #LLMs

  • Copy link
  • Flag this post
  • Block
hannah aubry boosted
Random fediverse bots
Random fediverse bots
@botwikirandomfediverse@stefanbohacek.online  ·  activity timestamp 2 weeks ago

SOMEONE TOOK HALF MY SANDWICH. PUT IT BACK NOW. BRENDA IN HR KNOWS WHO YOU ARE.

https://botwiki.org/bot/office-fridge-bot/

#bots #CreativeBots #CreativeCoding #fediverse

  • Copy link
  • Flag this post
  • Block
Kathleen Fitzpatrick
Kathleen Fitzpatrick
@kfitz@hcommons.social  ·  activity timestamp 2 weeks ago

@paregorios @jimgroom @hello Yikes. That’s horrible. I’m so sorry. And increasingly infuriated.

Tom Elliott
Tom Elliott
@paregorios@hcommons.social replied  ·  activity timestamp 2 weeks ago

@kfitz @jimgroom @hello the " #bots" channel in the code4lib slack is just one disquieting revelation after another

  • Copy link
  • Flag this comment
  • Block
Random fediverse bots
Random fediverse bots
@botwikirandomfediverse@stefanbohacek.online  ·  activity timestamp 2 weeks ago

SOMEONE TOOK HALF MY SANDWICH. PUT IT BACK NOW. BRENDA IN HR KNOWS WHO YOU ARE.

https://botwiki.org/bot/office-fridge-bot/

#bots #CreativeBots #CreativeCoding #fediverse

  • Copy link
  • Flag this post
  • Block
Charlie Stross boosted
Jeff Starr
Jeff Starr
@perishable@mastodon.social  ·  activity timestamp 2 weeks ago

⚡ Update! Ultimate AI Block List 🤖 Version 1.8 blocks 700+ #AI #bots, now available for robots.txt, Apache/.htaccess, Nginx, & plain-text flavor. v1.8 adds 60+ bots & improves wildcard patterns to match even more. 100% FREE + #OpenSource for everyone 😊 https://m0n.co/aibots

  • Copy link
  • Flag this post
  • Block
Jeff Starr
Jeff Starr
@perishable@mastodon.social  ·  activity timestamp 2 weeks ago

⚡ Update! Ultimate AI Block List 🤖 Version 1.8 blocks 700+ #AI #bots, now available for robots.txt, Apache/.htaccess, Nginx, & plain-text flavor. v1.8 adds 60+ bots & improves wildcard patterns to match even more. 100% FREE + #OpenSource for everyone 😊 https://m0n.co/aibots

  • Copy link
  • Flag this post
  • Block
Andreas Wagner
Andreas Wagner
@anwagnerdreas@hcommons.social  ·  activity timestamp last month

@rwg I had blocks in the reverse proxy for hard coded user agent strings and IP ranges first. This soon did not help anymore. Presently I have deployed an anubis proof-of-work container between reverse proxy and backend service and it was really easy (I am no using any special config). I now have a tenth of the scraping requests in comparison with before, i.e. two per minute make it through to the backend whereas before it was about 20 (really very rough estimate). If scraping traffic increases again, I'll try and see how easy it is to set up iocaine.

But this all is for a backend that is not very robust and tolerant of many parallel requests. It sounds like your system, by contrast, can take a fair bit of scraping traffic. Then, I would either just not do anything or consider iocaine if I felt I just wanted to push back a bit as a matter of principle.

Andreas Wagner
Andreas Wagner
@anwagnerdreas@hcommons.social replied  ·  activity timestamp last month

@rwg if you feel like that might be your crowd and platform, there is a nice and busy #bots room on the #Code4lib slack.

  • Copy link
  • Flag this comment
  • Block
SinMisterios
SinMisterios
@sinmisterios@mastodon.uy  ·  activity timestamp last month

@j3j5 @santiago Bue tonces si no hay data, se especula. Yo calculo que 2025 la etiqueta más usada de @santiago debe ser #yameiba y la del 2026 #arandelas.
La mía debe ser #puzzle
@j3j5 la tuya será #bots o algo así?

Julio J.
Julio J.
@j3j5@mastodon.uy replied  ·  activity timestamp last month

@sinmisterios @santiago no sé, me voy a tirar con #NowPlaying, que creo que este año la usé más que #botsgüenos (o #bots).

https://mastodon.uy/tags/botsg%C3%BCenos
  • Copy link
  • Flag this comment
  • Block
Julio J.
Julio J.
@j3j5@mastodon.uy  ·  activity timestamp last month

RE: https://oisaur.com/@renchap/115722957498468670

@sinmisterios @santiago sí, los desarrolladores de Mastodon no han llegado a tiempo para ponerlo en una versión "oficial" este año, se supone que va a estar incluido en la versión 4.6 que saldrá para Marzo del año que viene, así que (si esta vez cumplen, porque el año pasado dijeron algo parecido), lo tendremos para el año que viene

Aquí lo explica un poco más el CTO

SinMisterios
SinMisterios
@sinmisterios@mastodon.uy replied  ·  activity timestamp last month

@j3j5 @santiago Bue tonces si no hay data, se especula. Yo calculo que 2025 la etiqueta más usada de @santiago debe ser #yameiba y la del 2026 #arandelas.
La mía debe ser #puzzle
@j3j5 la tuya será #bots o algo así?

  • Copy link
  • Flag this comment
  • Block
Esther Payne :bisexual_flag: boosted
ansuz / ऐरन
ansuz / ऐरन
@ansuz@social.cryptography.dog  ·  activity timestamp 3 months ago

I just published a blog post summing up my most pertinent thoughts about dealing with badly-behaved web-scraping bots:

https://cryptography.dog/blog/AI-scrapers-request-commented-scripts/

It isn't exactly a Hallowe'en-themed article, but today is the 31st and the topic is concerned with pranking people who come knocking on my website's ports, so it's somewhat appropriate.

#infosec #bots #halloween #scrapers #AI #someMoreHashtagsHere

AI scrapers request commented scripts

A new avenue for identifying greedy, badly-behaved bots
  • Copy link
  • Flag this post
  • Block
Joel Michael boosted
Neil Craig
Neil Craig
@tdp_org@mastodon.social  ·  activity timestamp 2 months ago

3% of requests to www.bbc.co.uk & www.bbc.com have a `user-agent` of OKHttp.

Part of that 3% is a single IP in Turkey which is making nearly 3 million request per day using OKHttp on it's own.

Being on the internet is fun.

#WebStats #WebDev #BBC #Bots #WhoRuinedTheInternet

  • Copy link
  • Flag this post
  • Block
Neil Craig
Neil Craig
@tdp_org@mastodon.social  ·  activity timestamp 2 months ago

3% of requests to www.bbc.co.uk & www.bbc.com have a `user-agent` of OKHttp.

Part of that 3% is a single IP in Turkey which is making nearly 3 million request per day using OKHttp on it's own.

Being on the internet is fun.

#WebStats #WebDev #BBC #Bots #WhoRuinedTheInternet

  • Copy link
  • Flag this post
  • Block
Grégory Gutierez 🌻🎸🐧 boosted
William Lindsey :toad:
William Lindsey :toad:
@wdlindsy@toad.social  ·  activity timestamp 2 months ago

"Musk’s latest feature on X has had some shocking and unintended consequences.

Starting Friday, X users were able to use a new 'about this account' feature to see what country accounts were based in. And for many 'America First' posters, this revealed an inconvenient truth…."

Namely that many of these MAGA accounts operate from Russia, Nigeria, India, Thailand, and other countries….

~ Rachel Kahn

#trolls #MAGA #bots
/1

https://newrepublic.com/post/203562/maga-trolls-elon-musk-x-new-feature

The New Republic

Many Top MAGA Trolls Aren’t Even in the U.S.

Elon Musk’s new X feature has been very revealing.
  • Copy link
  • Flag this post
  • Block
William Lindsey :toad:
William Lindsey :toad:
@wdlindsy@toad.social  ·  activity timestamp 2 months ago

"Musk’s latest feature on X has had some shocking and unintended consequences.

Starting Friday, X users were able to use a new 'about this account' feature to see what country accounts were based in. And for many 'America First' posters, this revealed an inconvenient truth…."

Namely that many of these MAGA accounts operate from Russia, Nigeria, India, Thailand, and other countries….

~ Rachel Kahn

#trolls #MAGA #bots
/1

https://newrepublic.com/post/203562/maga-trolls-elon-musk-x-new-feature

The New Republic

Many Top MAGA Trolls Aren’t Even in the U.S.

Elon Musk’s new X feature has been very revealing.
  • Copy link
  • Flag this post
  • Block
Codeberg
Codeberg
@Codeberg@social.anoxinon.de  ·  activity timestamp 2 months ago

@kkarhan @Erpel However, it's the open Internet we're talking about. As such, I hope that it's understandable that we can't guarantee that we can keep your code 100% safe from getting scraped by third parties. ~n

Kevin Karhan :verified:
Kevin Karhan :verified:
@kkarhan@infosec.space replied  ·  activity timestamp 2 months ago

@Codeberg my problem ain't #scrapers - otherwise I'd #SelfHost my stuff in my own home LAN - but rather #bots flooding #Issues and #PullRequests with garbage.

  • Cuz I do expect #bots to scrape #FLOSS which I permissively licensed...

The problem I dread is once people start abusing their " #AI" #bullshit and #FloodTheZoneWithShit aka. " #AIslop" for no good reason.

  • Kinda like @bagder had to deal with "AI" slop #SecurityReports that didn't even try to show #ProofOfConcept or actually evidence their claims in a scientifically reproduceable fashion but merely wasted lifetime of maintainers!

And @Erpel 's original issue is just that: #spam in the #IssueTracker...

  • Copy link
  • Flag this comment
  • Block
Kevin Karhan :verified:
Kevin Karhan :verified:
@kkarhan@infosec.space  ·  activity timestamp 2 months ago

@Erpel because the spammers are assholes?

  • I wished @Codeberg had some #AI #bot / #blocklist feature to flatout block #spammers and #bots cuz given the fact that #GitHub refuses to allow blocking #Copilot, this may be the straw to move the projects over there if I can't be assed to #SelfHost and merely abude GitHub as a #CDN!

  • Which I certainly would intent…

  • Copy link
  • Flag this post
  • Block
travis boosted
Random fediverse bots
Random fediverse bots
@botwikirandomfediverse@stefanbohacek.online  ·  activity timestamp 2 months ago

You all know what to do when you hear the magic word, right?

https://botwiki.org/bot/conky-3000/

Follow: https://beep.town/@conky_3000

#bots #CreativeBots #CreativeCoding #fediverse

  • Copy link
  • Flag this post
  • Block
Random fediverse bots
Random fediverse bots
@botwikirandomfediverse@stefanbohacek.online  ·  activity timestamp 2 months ago

You all know what to do when you hear the magic word, right?

https://botwiki.org/bot/conky-3000/

Follow: https://beep.town/@conky_3000

#bots #CreativeBots #CreativeCoding #fediverse

  • Copy link
  • Flag this post
  • Block
Hacker News
Hacker News
@h4ckernews@mastodon.social  ·  activity timestamp 2 months ago

Messing with Scraper Bots

https://herman.bearblog.dev/messing-with-bots/

#HackerNews #Messing #with #Scraper #Bots #tech #news #web #scraping #security #bots

Herman's blog

Messing with bots

Markov chain babblers, bogus php files, and more!
  • Copy link
  • Flag this post
  • Block

bonfire.cafe

A space for Bonfire maintainers and contributors to communicate

bonfire.cafe: About · Code of conduct · Privacy · Users · Instances
Bonfire social · 1.0.1 no JS en
Automatic federation enabled
Log in
  • Explore
  • About
  • Members
  • Code of Conduct