Discussion
Loading...

Post

  • About
  • Code of conduct
  • Privacy
  • Users
  • Instances
  • About Bonfire
Ulrike Hahn
@UlrikeHahn@fediscience.org  ·  activity timestamp 2 months ago

just heard a really interesting talk by Ruoxi Qi from the University of Hong Kong about bias in LLMs.

They investigated LLMs bias toward WEIRD values by prompting LLMs and comparing their answers to World Values Survey (WVS) data (Haerpfer et
al., 2022). The WVS contains questions about human values and data from large representative samples from different parts of the world.

As expected they found bias toward WEIRD but also bias toward East Asia and Russia, presumably reflecting balance in the training data. In fact, whether a country was rich or not, was the best predictor of bias.

A really nice summary plot of their results from the paper is the Fig. 4 heatmap overlaid with clustering results that plots distance between model distribution and WSV distributions as a measure of value alignment! #cogsci25

https://escholarship.org/content/qt87d9k3tg/qt87d9k3tg.pdf

  • Copy link
  • Flag this post
  • Block
Louis Chartrand
@locha@fediscience.org replied  ·  activity timestamp 2 months ago
@UlrikeHahn Sounds super interesting and also super relevant. Your link gave me a 404, but I think I found it: https://escholarship.org/content/qt87d9k3tg/qt87d9k3tg.pdf
  • Copy link
  • Flag this comment
  • Block
Ulrike Hahn
@UlrikeHahn@fediscience.org replied  ·  activity timestamp 2 months ago
@locha thank you! will go back and edit!!
  • Copy link
  • Flag this comment
  • Block
Log in

bonfire.cafe

A space for Bonfire maintainers and contributors to communicate

bonfire.cafe: About · Code of conduct · Privacy · Users · Instances
Bonfire social · 1.0.0-rc.3.13 no JS en
Automatic federation enabled
  • Explore
  • About
  • Members
  • Code of Conduct
Home
Login