Nvidia Blackwell GPU throughput increases by 33 percent through vLLM software optimization
Friday, December 19, 2025 at 09:59 PM
Nvidia and vLLM have collaborated to optimize inference performance on the Blackwell architecture, achieving a 33% increase in maximum throughput per GPU over a one-month period while reducing costs per token.