Discussion
Loading...

Discussion

  • About
  • Code of conduct
  • Privacy
  • Users
  • Instances
  • About Bonfire
Charlie Stross
@cstross@wandering.shop  ·  activity timestamp 2 weeks ago

World's smallest violin needed, again:

https://www.hollywoodreporter.com/business/business-news/openai-loses-key-discovery-battle-why-deleted-library-of-pirated-books-1236436363/

"OpenAI Loses Key Discovery Battle as It Cedes Ground to Authors in AI Lawsuits" —if found guilty of 'wilful" infringement OpenAI could be on the hook for up to $150,000 per work. (And circumstances don't look great for them—they appear to have destroyed lists of the copyrighted works they'd trained GPT on.)

  • Copy link
  • Flag this post
  • Block
David Revoy
@davidrevoy@framapiaf.org replied  ·  activity timestamp 2 weeks ago

@cstross $150,000 per work?!
Okay, OpenAI, there are at least 20 pages of my work in your database, and I'm too lazy to do the math at this point. I'll accept if you're too lazy to do the math and decide to round up to $2 million. Just use the IBAN on my blog. Thank you. 😆

A screenshot of the website "Have I been trained" offering a GUI to watch into the LAION database of the AI giants and perform a search on it. On the screenshot: many , many items returned by the search for Pepper&Carrot, my comic.
A screenshot of the website "Have I been trained" offering a GUI to watch into the LAION database of the AI giants and perform a search on it. On the screenshot: many , many items returned by the search for Pepper&Carrot, my comic.
A screenshot of the website "Have I been trained" offering a GUI to watch into the LAION database of the AI giants and perform a search on it. On the screenshot: many , many items returned by the search for Pepper&Carrot, my comic.
  • Copy link
  • Flag this comment
  • Block
Erik Ableson
@erik@mastodon.infrageeks.social replied  ·  activity timestamp 2 weeks ago

@cstross

Two stickers. One with the Pirate Bay logo over top of the text "Training Data" and the second is the phrase "You wouldn't download a training data" in the style of the anti-piracy ads added to the beginning of films
Two stickers. One with the Pirate Bay logo over top of the text "Training Data" and the second is the phrase "You wouldn't download a training data" in the style of the anti-piracy ads added to the beginning of films
Two stickers. One with the Pirate Bay logo over top of the text "Training Data" and the second is the phrase "You wouldn't download a training data" in the style of the anti-piracy ads added to the beginning of films
  • Copy link
  • Flag this comment
  • Block
Dodo
@Dodo_sipping@cupoftea.social replied  ·  activity timestamp 2 weeks ago

@cstross let me guess, how this will end - they will pay a huge amount to big publishers. Not the authors.

  • Copy link
  • Flag this comment
  • Block
Charlie Stross
@cstross@wandering.shop replied  ·  activity timestamp 2 weeks ago

@Dodo_sipping Wrong. (I say this as an author in one of the lawsuits.)

  • Copy link
  • Flag this comment
  • Block
Osma A 🇫🇮🇺🇦
@osma@mas.to replied  ·  activity timestamp 2 weeks ago

If the information regarding the source is destroyed, perhaps the responsible action is to destroy the product, too. That's what food and drug safety would dictate.
@cstross

  • Copy link
  • Flag this comment
  • Block
Galad
@galad@mastodon.social replied  ·  activity timestamp 2 weeks ago

@cstross Given the scale of the theft I genuinely wonder just how any settlement would be distributed.

  • Copy link
  • Flag this comment
  • Block
Charlie Stross
@cstross@wandering.shop replied  ·  activity timestamp 2 weeks ago

@galad It's in progress right now and there's a mechanism in place to distribute most of the money directly to creators. Hint: I'm an affected author, I'm part of the settlement.

  • Copy link
  • Flag this comment
  • Block
Robert
@DataMolecular@mastodon.social replied  ·  activity timestamp 2 weeks ago

@cstross just wait for it, when RICO drops.

  • Copy link
  • Flag this comment
  • Block
Softwarewolf
@faoluin@chitter.xyz replied  ·  activity timestamp 2 weeks ago

@cstross Unfortunately, copyright law has been twisted to mostly benefit the wealthiest publishers. This lawsuit is evil versus evil.

  • Copy link
  • Flag this comment
  • Block
Ooze 𓁟
@Ooze@wirejunkie.net replied  ·  activity timestamp 2 weeks ago

@cstross The genie is out of the bottle and no amount of recompense is going to put it back. Making them open source their model as well as paying restitution seems fitting.

  • Copy link
  • Flag this comment
  • Block
JD
@JDGeoShack@social.vivaldi.net replied  ·  activity timestamp 2 weeks ago

@cstross Best news I've heard in a while. Thanks for sharing, I'm over here laughing.

  • Copy link
  • Flag this comment
  • Block
Wulfy
@n_dimension@infosec.exchange replied  ·  activity timestamp 2 weeks ago

@cstross

It might be the first #legal precedent establishing if #AI could be used as witness /evidence in a court of #law.

#Chatgpt, "raw", without the guardrails pre-prompt.

Prompt: "Tell us the LIKELY list of works used to train your vector tree. Where no specific data exists, conduct lexical and linquistic analysis of the structures to estimate with high degree of likelyhood of works authors" 😁

#promptengineering

  • Copy link
  • Flag this comment
  • Block
ginoputrino
@ginoputrino@mastodon.social replied  ·  activity timestamp 2 weeks ago

@cstross I would be astounded if they have actually destroyed the lists of training material, or even the training data itself.

You need all that if you are going to retrain a new version of your LLM in a different manner, and the "field" is moving so fast it would be corporate malfeasance to destroy your source training data.

This seems like a very transparent lie to the court.

  • Copy link
  • Flag this comment
  • Block
Nicole Parsons
@Npars01@mstdn.social replied  ·  activity timestamp 2 weeks ago

@cstross

Even if OpenAI destroyed the list of copyrighted works they stole to train AI, the AI themselves reveal they contain the results of plagiarism

If you ask an AI to find the source of this quote & it finds it...
"A dark-skinned human with four arms walks toward me across the floor of the club, clad only in a belt strung with human skulls. Her hair forms a smoky wreath around her open and curious face. She's interested in me"

Reasonable inference is kind of cool
https://www.williambarabino.com/blog/2018/07/27/inferences/

  • Copy link
  • Flag this comment
  • Block
thriftwicker
@thriftwicker@mastodon.social replied  ·  activity timestamp 2 weeks ago

@cstross Ooh! Can we go after the music generators next?!

  • Copy link
  • Flag this comment
  • Block
oscarfalcon
@oscarfalcon@mastodon.social replied  ·  activity timestamp 2 weeks ago

@cstross

Fuck Them! I hope they disappear into oblivion...

  • Copy link
  • Flag this comment
  • Block
Arena Cops 🇺🇦✌
@ArenaCops@infosec.exchange replied  ·  activity timestamp 2 weeks ago

RE: https://wandering.shop/@cstross/115623499457379506

Reminder: Stealing copyrighted materials to train AI LLMs without the permission of copyright holders is a new kind of organized crime — organized by a considerable number of ethics-free "Big Tech" corps.

P.S.: Release the unredacted, unmanipulated, not fraudulently Trump-absolving Epstein files!

  • Copy link
  • Flag this comment
  • Block
Porky Nolosdos
@fartnuggets@jorts.horse replied  ·  activity timestamp 2 weeks ago

@cstross an organization that operates on a foundation of theft is organized crime.

  • Copy link
  • Flag this comment
  • Block
Fred
@fcbsd@hachyderm.io replied  ·  activity timestamp 2 weeks ago

@cstross isn't the CTO of OpenAI on record saying they needed copyrighted material for their models to be useful - sounds pretty 'wilful' infringement to me...

  • Copy link
  • Flag this comment
  • Block
a colony of eukaryotes
@DogBiscuitUK@mastodon.scot replied  ·  activity timestamp 2 weeks ago

@cstross The only course available - if they have indeed destroyed lists of training material - is to assume all works, without exception, were on those lists.

  • Copy link
  • Flag this comment
  • Block
Flaming Cheeto
@PizzaDemon@mastodon.online replied  ·  activity timestamp 2 weeks ago

@cstross

Tardigrade playing a violin
Tardigrade playing a violin
Tardigrade playing a violin
  • Copy link
  • Flag this comment
  • Block
Log in

bonfire.cafe

A space for Bonfire maintainers and contributors to communicate

bonfire.cafe: About · Code of conduct · Privacy · Users · Instances
Bonfire social · 1.0.1-alpha.8 no JS en
Automatic federation enabled
  • Explore
  • About
  • Members
  • Code of Conduct
Home
Login