LPU - AI Infrastructure Intelligence Search

NVIDIA Product Launch High Signal 2026-04-27

NVIDIA Rubin Delayed, Blackwell to Account for 71% of High-End GPU Shipments in 2026

NVIDIA Rubin GPU production target lowered from 2M to 1.5M units due to HBM4 memory validation delays. TrendForce data shows Blackwell share rising from 61% to 71% in 2026, consolidating dominance. Micron exits Rubin HBM4 supply chain, SK hynix to hold 70% share. Analysts maintain overweight ratings, viewing impact as limited. Rubin delay may extend SK hynix's HBM3E market dominance.

Google Other 2026-04-22

Google Cloud Next '26: Agent Gateway Seizes Control Plane, TPU 8i Locks Inference

Google Cloud Next '26 announces 8th-gen TPUs (8t for training, 8i for inference), Agent Platform with Agent Gateway, Agent Identity, Agent-to-Agent Orchestration, Agentic Data Cloud, and Agentic Defense integrating Wiz. The move shifts control from infrastructure to agent orchestration, locking enterprises into a vertically integrated stack.

NVIDIA Other 1970-01-01

NVIDIA Acquires Groq LPU: Inference Architecture Shift from HBM to On-Chip SRAM

NVIDIA signs ~$20B licensing deal with Groq for LPU tech, featuring 230MB on-chip SRAM at 80TB/s bandwidth. This targets Transformer inference decode, replacing HBM bottlenecks with ultra-low latency on-chip storage, potentially reshaping the AI inference chip landscape.

NVIDIA Other 1970-01-01

NVIDIA Absorbs Groq LPU: Feynman GPU to Integrate SRAM Inference Tile, Hybrid Architecture by 2028

NVIDIA secures Groq's LPU inference technology via a non-exclusive license and key hires, planning to integrate large SRAM tiles into its 2028 Feynman GPU using TSMC SoIC hybrid bonding. This enables deterministic scheduling and 80TB/s on-chip bandwidth, shifting NVIDIA from a pure GPU vendor to a hybrid inference/training platform.

Reports

Filter

NVIDIA Rubin Delayed, Blackwell to Account for 71% of High-End GPU Shipments in 2026

Google Cloud Next '26: Agent Gateway Seizes Control Plane, TPU 8i Locks Inference

NVIDIA Acquires Groq LPU: Inference Architecture Shift from HBM to On-Chip SRAM

NVIDIA Absorbs Groq LPU: Feynman GPU to Integrate SRAM Inference Tile, Hybrid Architecture by 2028