vLLM large scale serving: DeepSeek 2.2k tok/s/h200 with wide-ep
https://blog.vllm.ai/2025/12/17/large-scale-serving.html
#HackerNews #vLLM #large #scale #serving #DeepSeek #tok/s #wide-ep #AI #technology
vLLM large scale serving: DeepSeek 2.2k tok/s/h200 with wide-ep
https://blog.vllm.ai/2025/12/17/large-scale-serving.html
#HackerNews #vLLM #large #scale #serving #DeepSeek #tok/s #wide-ep #AI #technology