Discussion
Loading...

#Tag

  • About
  • Code of conduct
  • Privacy
  • Users
  • Instances
  • About Bonfire
Râu Cao ⚡
Râu Cao ⚡ boosted
Anais
@anaiscrosby@infosec.exchange  ·  activity timestamp yesterday

A small number of samples can poison #LLMs of any size

"Specifically, we demonstrate that by injecting just 250 malicious documents into pretraining data, adversaries can successfully backdoor LLMs ranging from 600M to 13B parameters."

cc @asrg

https://www.anthropic.com/research/small-samples-poison

#DestroyAI

A small number of samples can poison LLMs of any size

Anthropic research on data-poisoning attacks in large language models
  • Copy link
  • Flag this post
  • Block
Anais
@anaiscrosby@infosec.exchange  ·  activity timestamp yesterday

A small number of samples can poison #LLMs of any size

"Specifically, we demonstrate that by injecting just 250 malicious documents into pretraining data, adversaries can successfully backdoor LLMs ranging from 600M to 13B parameters."

cc @asrg

https://www.anthropic.com/research/small-samples-poison

#DestroyAI

A small number of samples can poison LLMs of any size

Anthropic research on data-poisoning attacks in large language models
  • Copy link
  • Flag this post
  • Block
Log in

bonfire.cafe

A space for Bonfire maintainers and contributors to communicate

bonfire.cafe: About · Code of conduct · Privacy · Users · Instances
Bonfire social · 1.0.0-rc.3.13 no JS en
Automatic federation enabled
  • Explore
  • About
  • Members
  • Code of Conduct
Home
Login