#Tag · bonfire.cafe

Discussion

#Tag

About
Code of conduct
Privacy
Users
Instances
About Bonfire

A small number of samples can poison #LLMs of any size

"Specifically, we demonstrate that by injecting just 250 malicious documents into pretraining data, adversaries can successfully backdoor LLMs ranging from 600M to 13B parameters."

cc @asrg

https://www.anthropic.com/research/small-samples-poison

#DestroyAI

A small number of samples can poison LLMs of any size

Anthropic research on data-poisoning attacks in large language models

Copy link
Flag this post

Block

Anais

@anaiscrosby@infosec.exchange · activity timestamp 3 weeks ago

A small number of samples can poison #LLMs of any size

"Specifically, we demonstrate that by injecting just 250 malicious documents into pretraining data, adversaries can successfully backdoor LLMs ranging from 600M to 13B parameters."

cc @asrg

https://www.anthropic.com/research/small-samples-poison

#DestroyAI

A small number of samples can poison LLMs of any size

Anthropic research on data-poisoning attacks in large language models

Copy link
Flag this post

Block

bonfire.cafe

A space for Bonfire maintainers and contributors to communicate

bonfire.cafe: About · Code of conduct · Privacy · Users · Instances

Bonfire social · 1.0.0-rc.3.21 no JS en

Automatic federation enabled

Explore
About
Members
Code of Conduct

Home Login