I've been pretty hard on the LLM bros, but this truly is innovation (in benchmarketing):
Post
Replies:
2
@slightlyoff Otherwise also called benchmaxxing
@slightlyoff As no-one else seemed to have bothered, I opened https://github.com/FujitsuResearch/FieldWorkArena/issues/1