Reports
AI-generated structured vendor updates
NVIDIA Rubin Delayed, Blackwell to Account for 71% of High-End GPU Shipments in 2026
NVIDIA Rubin GPU production target lowered from 2M to 1.5M units due to HBM4 memory validation delays. TrendForce data shows Blackwell share rising from 61% to 71% in 2026, consolidating dominance. Micron exits Rubin HBM4 supply chain, SK hynix to hold 70% share. Analysts maintain overweight ratings, viewing impact as limited. Rubin delay may extend SK hynix's HBM3E market dominance.
Google Cloud Next '26: Agent Gateway Seizes Control Plane, TPU 8i Locks Inference
Google Cloud Next '26 announces 8th-gen TPUs (8t for training, 8i for inference), Agent Platform with Agent Gateway, Agent Identity, Agent-to-Agent Orchestration, Agentic Data Cloud, and Agentic Defense integrating Wiz. The move shifts control from infrastructure to agent orchestration, locking enterprises into a vertically integrated stack.
NVIDIA Acquires Groq LPU: Inference Architecture Shift from HBM to On-Chip SRAM
NVIDIA signs ~$20B licensing deal with Groq for LPU tech, featuring 230MB on-chip SRAM at 80TB/s bandwidth. This targets Transformer inference decode, replacing HBM bottlenecks with ultra-low latency on-chip storage, potentially reshaping the AI inference chip landscape.
NVIDIA Absorbs Groq LPU: Feynman GPU to Integrate SRAM Inference Tile, Hybrid Architecture by 2028
NVIDIA secures Groq's LPU inference technology via a non-exclusive license and key hires, planning to integrate large SRAM tiles into its 2028 Feynman GPU using TSMC SoIC hybrid bonding. This enables deterministic scheduling and 80TB/s on-chip bandwidth, shifting NVIDIA from a pure GPU vendor to a hybrid inference/training platform.