Discussion
Loading...

Post

Log in
  • About
  • Code of conduct
  • Privacy
  • Users
  • Instances
  • About Bonfire
vruz
@vruz@mstdn.social  ·  activity timestamp 4 days ago

Training experimental models to behave like a MAGA idiot, so that they learn how to suppress the worst 'persona' traits in LLMs.

Unfortunately you can't do this with people.
https://www.anthropic.com/research/persona-vectors

#anthropic #ai #llm

Sorry, no caption provided by author
Sorry, no caption provided by author
Sorry, no caption provided by author
  • Copy link
  • Flag this post
  • Block
Log in

bonfire.cafe

A space for Bonfire maintainers and contributors to communicate

bonfire.cafe: About · Code of conduct · Privacy · Users · Instances
Bonfire social · 1.0.0-rc.2.6 no JS en
Automatic federation enabled
  • Explore
  • About
  • Members
  • Code of Conduct