Discussion
Loading...

Discussion

Log in
  • About
  • Code of conduct
  • Privacy
  • Users
  • Instances
  • About Bonfire
Ulrike Hahn
Ulrike Hahn
@UlrikeHahn@fediscience.org  ·  activity timestamp 3 hours ago

@imbl @alexh I have claimed no such thing.

And It is, in fact, wholly immaterial to my concern about this how you want to characterise the agent’s capabilities; what *is* relevant to me is the outcome. You can put “blackmail” and “decide” in whatever scare quotes you find appropriate, it doesn’t to my mind impact the fact that this is an undesirable behaviour which we now have the capacity to roll out at scale.

But for what it’s worth you might find this recent pre-print of interest: https://zenodo.org/records/18231172

Zenodo

What is reasoning anyway? A closer look at reasoning in LLMs

There is a remarkable degree of polarisation in current debate about the capacities of Large Language Models (LLMs). One example of this is the debate about reasoning. Some researchers see ample evidence of reasoning in these systems, while others maintain that these systems do not reason at all. This paper seeks to shed light on this debate by examining the divergent uses of the term reasoning across different disciplines. It provides a simple clarificatory framework for talking about behaviour that highlights key dimensions of variation in how ‘reasoning’ is used across psychology, philosophy and AI. This highlights not just the extent to which researchers are talking past each other, but also that common inferences about model capability that accompany classification decisions are, in fact, far less compelling than they might seem.
  • Copy link
  • Flag this post
  • Block
Ulrike Hahn
Ulrike Hahn
@UlrikeHahn@fediscience.org replied  ·  activity timestamp 3 hours ago

@imbl @alexh by coincidence, I also have a long standing research interest in the notion of burden of proof and its role in argument 😉

https://www.researchgate.net/profile/Ulrike-Hahn-3/publication/227070271_The_Burden_of_Proof_and_Its_Role_in_Argumentation/links/004635274a8deebdff000000/The-Burden-of-Proof-and-Its-Role-in-Argumentation.pdf

  • Copy link
  • Flag this comment
  • Block

bonfire.cafe

A space for Bonfire maintainers and contributors to communicate

bonfire.cafe: About · Code of conduct · Privacy · Users · Instances
Bonfire social · 1.0.2-alpha.27 no JS en
Automatic federation enabled
Log in
  • Explore
  • About
  • Members
  • Code of Conduct