News

Nvidia CEO Jensen Huang outlines plans to optimize memory hierarchy across SRAM, HBM, and LPDDR

Tuesday, March 17, 2026 at 11:01 PM

Nvidia CEO Jensen Huang highlighted the strategic optimization of the memory hierarchy, specifically focusing on SRAM, HBM, and LPDDR components to improve AI infrastructure performance.

Context

At GTC 2026, Nvidia CEO Jensen Huang introduced a comprehensive strategy to optimize the memory hierarchy for the next generation of agentic AI. The centerpiece of this roadmap is the Vera Rubin platform, which integrates custom Olympus cores with a sophisticated memory architecture. By utilizing NVLink-C2C with 1.8 TB/s of coherent bandwidth, Nvidia is tightening the connection between SRAM, HBM4, and LPDDR5X to eliminate bottlenecks in KV cache management and token generation. This shift marks a transition from general-purpose data centers to specialized AI factories. Key to this is the new NVIDIA Dynamo operating system and the BlueField-4 STX architecture, which can deliver up to 5x token throughput and 4x energy efficiency. By partnering with memory leaders like SK hynix and Micron, Nvidia aims to scale bandwidth to 1.2 TB/s for its Vera CPU while significantly reducing power consumption. These optimizations are critical for supporting long-context reasoning and real-time agentic workflows scheduled for broad deployment in the second half of 2027.

Sources (12)

NVIDIA Launches BlueField-4 STX Storage Architecture With Broad Industry Adoption | NVIDIA Newsroom Memory Analysis with NVIDIA Nsight Compute | Other 2024 Micron Innovates From the Data Center to the Edge With NVIDIA | Micron Technology Maximizing GPU Utilization with NVIDIA Run:ai and NVIDIA NIM | NVIDIA Technical Blog SK hynix Reaffirms Partnership With NVIDIA at GTC 2026, Unveiling Latest AI Memory Portfolio Scaling the Memory Wall: The Rise and Roadmap of HBM GTC Keynote with NVIDIA CEO Jensen Huang - Rev Deep Dive into NVIDIA Groq 3 LPU: A New Choice for AI Inference

Related Companies

Nvidia

NVDA