Discussion
Loading...

Post

Log in
  • About
  • Code of conduct
  • Privacy
  • Users
  • Instances
  • About Bonfire
fancysandwiches
fancysandwiches
@fancysandwiches@neuromatch.social  ·  activity timestamp 13 hours ago

A coworker is trying to get everyone at work to just vibe code everything and today in a meeting he said something along the lines of "I know today that we rely on internal expertise, but I don't think we should do that anymore". Buddy if we can't rely on internal expertise how the fuck are we supposed to validate the output of these LLMs? We can't trust the LLM to validate itself. It was faking the tests in his PR. It wrote dozens of tests that asserted nothing, but he didn't see a problem with that because the test coverage was higher than average.

  • Copy link
  • Flag this post
  • Block
jonny (good kind)
jonny (good kind)
@jonny@neuromatch.social replied  ·  activity timestamp 5 hours ago

@fancysandwiches
the ones that flood my notifs have graduated to making fake versions of the objects that reimplement a simpler version of the thing they do, only using those in the tests, calling them mocks, and insisting that unit tests don't run code in the package because that's an integration test

  • Copy link
  • Flag this comment
  • Block

bonfire.cafe

A space for Bonfire maintainers and contributors to communicate

bonfire.cafe: About · Code of conduct · Privacy · Users · Instances
Bonfire social · 1.0.2-alpha.7 no JS en
Automatic federation enabled
Log in
  • Explore
  • About
  • Members
  • Code of Conduct