News

Nvidia Blackwell GPU throughput increases by 33 percent through vLLM software optimization

Friday, December 19, 2025 at 09:59 PM

Nvidia and vLLM have collaborated to optimize inference performance on the Blackwell architecture, achieving a 33% increase in maximum throughput per GPU over a one-month period while reducing costs per token.

Related Companies

Nvidia
Nvidia
NVDA
US