Training experimental models to behave like a MAGA idiot, so that they learn how to suppress the worst 'persona' traits in LLMs.
Unfortunately you can't do this with people.
https://www.anthropic.com/research/persona-vectors
Training experimental models to behave like a MAGA idiot, so that they learn how to suppress the worst 'persona' traits in LLMs.
Unfortunately you can't do this with people.
https://www.anthropic.com/research/persona-vectors
A space for Bonfire maintainers and contributors to communicate