At the 2026 Nvidia GTC conference, Jensen Huang announced an inference-specific chip, the Groq 3 LPU. The LPU will work in concert with the Rubin GPU to accelerate AI workloads. According to TrendForce's latest findings on AI servers, NVIDIA's high-end AI chip shipment mix is expected to change in 2026. This week, over 30,000 people are descending upon San Jose, Calif., to attend Nvidia GTC, the so-called Superbowl of AI—a. Nvidia's Blackwell Ultra chips, the company's next-generation graphics processor for AI, have been commercially deployed at CoreWeave, the companies announced on Thursday. CoreWeave historically has a close relationship with Nvidia, which owns a stake in the cloud provider. CoreWeave went public. The Rubin platform harnesses extreme codesign across hardware and software to deliver up to 10x reduction in inference token cost and 4x reduction in number of GPUs to train MoE models, compared with the NVIDIA Blackwell platform. NVIDIA Spectrum-X Ethernet Photonics switch systems deliver 5x.
[PDF Version]