Discussion
Loading...

Post

  • About
  • Code of conduct
  • Privacy
  • Users
  • Instances
  • About Bonfire
Thib
@thibaultamartin@mamot.fr  ·  activity timestamp 2 months ago

Popular online books archive Anna's Archive is monetizing access to their dataset by selling high-speed access to their collection to LLM vendors, and I don't know what to think about it.

https://annas-archive.org/llm

#llm #ai #ethics

  • Copy link
  • Flag this post
  • Block
Gabriel N
@gnyman@infosec.exchange replied  ·  activity timestamp last month
@thibaultamartin my initial reaction is that it seems kind of pragmatic, most of what they have is anyways available as a torrent

And I bet they are being hammered by scraping bots who doesn't realise this fact or can't download a torrent. Meta is having big problems because they were caught downloading the torrents.

I bet many scraping companies are trying very hard to pretend they don't know what they are ingesting, they had "no idea" they were scraping AA , they just implemented all the AA specific anti-anti-bot things as part of their normal attempts to bypass the wishes of the content producers.

I don't see how paying AA for access would be possible for these, but if AA gets a bit of extra donations then that's a win.

  • Copy link
  • Flag this comment
  • Block
decryption
@decryption@aus.social replied  ·  activity timestamp 2 months ago
@thibaultamartin hmmm this isn’t why I donated to them
  • Copy link
  • Flag this comment
  • Block
Joe Brockmeier (@jzb)
@jzb@mastodon.social replied  ·  activity timestamp 2 months ago
@thibaultamartin I guarantee the cops are only going to go after one party in that transaction.
  • Copy link
  • Flag this comment
  • Block
Log in

bonfire.cafe

A space for Bonfire maintainers and contributors to communicate

bonfire.cafe: About · Code of conduct · Privacy · Users · Instances
Bonfire social · 1.0.0-rc.3.1 no JS en
Automatic federation enabled
  • Explore
  • About
  • Members
  • Code of Conduct
Home
Login