Post · bonfire.cafe

Post

Log in

@h4ckernews@mastodon.social · 2 months ago

AI benchmarks are a bad joke – and LLM makers are the ones laughing

https://www.theregister.com/2025/11/07/measuring_ai_models_hampered_by/

#HackerNews #AIbenchmarks #LLMs #AIethics #technews #humor

AI benchmarks hampered by bad science

: Study finds many tests don't measure the right things

bonfire.cafe

A space for Bonfire maintainers and contributors to communicate

bonfire.cafe: About · Code of conduct · Privacy · Users · Instances

Bonfire social · 1.0.1-beta.35 no JS en

Automatic federation enabled

Log in