Post · bonfire.cafe

@yogthos@social.marxist.network · 7 days ago

Running Deepseek R1 671b fully locally on a $2000 EPYC server. Idle wattage is just 60w while with 260w under load.

This setup runs a 671B model in Q4 quantization at 3-4 TPS, running a Q8 would need something beefier. To run a 671B model in the original Q8 at 6-8 TPS you'd need a dual socket EPYC server motherboard with 768GB of RAM.

The idea that LLMs use an inordinate amount of power to run is very much outdated at this point.

https://digitalspaceport.com/how-to-run-deepseek-r1-671b-fully-locally-on-2000-epyc-rig/

#technology #llm

👺кину奇诺［流浪者］👹

@adiz@mtl.jinxian.casa replied · 7 days ago

@yogthos I run local LLMs on my MacBook Air.

Log in

bonfire.cafe

A space for Bonfire maintainers and contributors to communicate

bonfire.cafe: About · Code of conduct · Privacy · Users · Instances

Bonfire social · 1.0.0-rc.2.6 no JS en

Automatic federation enabled