#Tag · bonfire.cafe

#Tag

jbz

@jbz@indieweb.social · 2 weeks ago

📰 News publishers limit Internet Archive access due to AI scraping concerns

https://www.niemanlab.org/2026/01/news-publishers-limit-internet-archive-access-due-to-ai-scraping-concerns/

#ai #aiscraping #internetarchive #lostmedia

Nieman Lab

News publishers limit Internet Archive access due to AI scraping concerns

Outlets like The Guardian and The New York Times are scrutinizing digital archives as potential backdoors for AI crawlers.

jbz

@jbz@indieweb.social · 7 months ago

(⁠⌐⁠■⁠-⁠■⁠) Perplexity is using stealth, undeclared crawlers to evade website no-crawl directives

https://blog.cloudflare.com/perplexity-is-using-stealth-undeclared-crawlers-to-evade-website-no-crawl-directives

#perplexity #aisearch #aiscraping

Paolo Amoroso

@amoroso@oldbytes.space · 7 months ago

Scraping for AI training may or may not be legal. But the effort crawlers put into evading detection and blocking is a smoking gun, an admission this scraping is not fair.

https://arstechnica.com/tech-policy/2025/08/ai-industry-horrified-to-face-largest-copyright-class-action-ever-certified

#AIscraping #scrapers #ai

Paul Chambers🚧

@paul@oldfriends.live · 8 months ago

A website appears to be scraping hashtags and creating AI articles, and then replying to the OG post

It stole one of my posts (https://oldfriends.live/@paul/114770093020700675) for its AI created article then spammed me from @s00laiman

It's doing it with #HashTagGames tags and other trending hashtags.

https://www.trend247daily.com/articles

#MastoAdmin

Article created from scraped post: https://www.trend247daily.com/article/mastering-the-art-of-the-productive-day-wake-up-look-busy-go-to-bed

See this thread above, unless the AI content spammer deletes its reply and breaks the thread.

I don't know where it is getting its content, from it's Mastodon Account ( @s00laiman ) account, rss, or the API. If it has an application I would hope @staff and @moderation would shut it down from scraping the API.

#Spam #Fediblock #AIScraping