Post · bonfire.cafe

@anaiscrosby@infosec.exchange · 3 weeks ago

A small number of samples can poison #LLMs of any size

"Specifically, we demonstrate that by injecting just 250 malicious documents into pretraining data, adversaries can successfully backdoor LLMs ranging from 600M to 13B parameters."

cc @asrg

https://www.anthropic.com/research/small-samples-poison

#DestroyAI

A small number of samples can poison LLMs of any size

Anthropic research on data-poisoning attacks in large language models

bonfire.cafe

A space for Bonfire maintainers and contributors to communicate

bonfire.cafe: About · Code of conduct · Privacy · Users · Instances

Bonfire social · 1.0.0-rc.3.21 no JS en

Automatic federation enabled