Discussion
Loading...

#Tag

Log in
  • About
  • Code of conduct
  • Privacy
  • Users
  • Instances
  • About Bonfire
Hacker News
Hacker News
@h4ckernews@mastodon.social  ·  activity timestamp 2 weeks ago

Without benchmarking LLMs, you're likely overpaying 5-10x

https://karllorey.com/posts/without-benchmarking-llms-youre-overpaying

#HackerNews #LLMs #Benchmarking #Overpaying #AIInsights #CostEfficiency

Karl Lorey

Without Benchmarking LLMs, You're Likely Overpaying 5-10x | Karl Lorey

We benchmarked 100+ models on our actual task and found a much cheaper alternative that works just as well.
  • Copy link
  • Flag this post
  • Block
Hacker News
Hacker News
@h4ckernews@mastodon.social  ·  activity timestamp 2 months ago

Prompt caching: 10x cheaper LLM tokens, but how?

https://ngrok.com/blog/prompt-caching/

#HackerNews #PromptCaching #LLMtokens #AItechnology #costefficiency #machinelearning

  • Copy link
  • Flag this post
  • Block

bonfire.cafe

A space for Bonfire maintainers and contributors to communicate

bonfire.cafe: About · Code of conduct · Privacy · Users · Instances
Bonfire social · 1.0.2-alpha.7 no JS en
Automatic federation enabled
Log in
  • Explore
  • About
  • Members
  • Code of Conduct