Reports
AI-generated structured vendor updates
AMD EPYC Challenges Rack-Scale Density for Agentic AI Control
AMD claims its EPYC processors lead in rack-scale performance for agentic AI's CPU-intensive services (orchestration, caching, databases). Under a 100kW rack model, EPYC 9965 'Turin' delivers 2.37x throughput over NVIDIA Vera, with next-gen 'Venice' projected at 3.30x. Emphasizes deployability on current x86 platforms, avoiding future architecture dependency.
Microsoft Build 2026: Unifying Agent Stack from Chip to Cloud
At Build 2026, Microsoft unveiled a comprehensive agent-era platform: Project Solara (chip-to-cloud), Microsoft IQ (unified grounding), Rayfin (backend generation), Azure HorizonDB, and GPU-accelerated analytics. The goal is to lock developers into Microsoft's ecosystem.
Intel at Computex 2026: 18A, Rackscale, and the Shift to CPU-Centric AI Orchestration
Intel unveils Core Ultra Series 3 on 18A, Xeon 6+ with 288 e-cores, a hybrid local inference orchestrator with Perplexity, rackscale AI infrastructure with Foxconn, and disaggregated inference cloud with SambaNova. The keynote positions the CPU as the central orchestrator for agentic AI, signaling a control plane shift from GPU to x86.
GTC Taipei 2026: Vera 88-Core CPU Designed for Agents, 1.8x x86 Performance
NVIDIA launched first standalone data center microprocessor Vera at GTC Taipei 2026, directly competing with Intel Xeon and AMD EPYC for the first time. 88 custom Olympus Arm cores, monolithic mesh (not chiplet), 50% faster inter-core communication. LPDDR5X 1.2TB/s bandwidth, PCIe Gen6. Agent sandbox 1.8x x86. First customers: OpenAI, Anthropic, SpaceX. Q3 2026 production, FY CPU revenue target $20B. Marks NVIDIA's strategic leap from GPU accelerator vendor to full-stack data center platform vendor.
Intel and SambaNova Rackscale AI: CPU Regains Inference Control Plane
At Computex 2026, Intel unveiled rack-scale AI infrastructure combining Xeon 6+ with SambaNova SN-50 RDUs, plus a fully disaggregated inference cloud (prefill on NVIDIA Blackwell, decode on RDUs) by Vector Core Compute. This aims to reposition the CPU as the central orchestrator for inference, challenging GPU dominance.
NVIDIA Transaction Foundation Models Shift Financial AI Control to Unified GPU Stack
NVIDIA launches a developer example for transaction foundation models, partnering with Revolut, Mastercard, and others to replace siloed ML models with unified transformer-based systems. Leveraging Hopper GPUs, cuDF, and Nemotron, it shifts financial data processing from feature engineering to unified embeddings, effectively moving control to NVIDIA's hardware ecosystem.
Arm-NVIDIA RTX Spark: Tightly Coupled CPU-GPU for Agentic AI PCs
The Arm-based NVIDIA RTX Spark integrates Arm Grace CPU with NVIDIA Blackwell RTX GPU via unified memory, enabling ultra-low latency on-device AI inference for the agentic era. This platform marks a major milestone for Windows on Arm, targeting developers, creators, and gamers.
Arm and NVIDIA RTX Spark: Unified Memory PC Architecture Targets Agentic AI, Encircles x86
Arm and NVIDIA unveil RTX Spark, an Arm-based Grace CPU + Blackwell RTX GPU platform with unified memory, targeting Windows on Arm for agentic AI inference. It delivers 1 Petaflop, reduces token cost, and signals a PC paradigm shift from app-driven to agent-driven, backed by Microsoft.
NVIDIA Vera 88-Core Arm CPU: Control Plane Shifts from x86 to NVIDIA for AI Agent Workloads
NVIDIA unveils Vera, its first standalone datacenter CPU with 88 custom Arm Olympus cores, monolithic mesh, 1.2TB/s LPDDR5X bandwidth, achieving 1.8x x86 performance in agent workloads. Tightly coupled with GPUs via NVLink-C2C, Vera shifts the control plane from Intel/AMD to NVIDIA. First customers: OpenAI, Anthropic. Production Q3 2026.
NVIDIA FOX Blueprint Shifts Factory Control from PLCs to AI Agents on DGX
NVIDIA unveiled the Factory Operations Blueprint (FOX), a reference design for autonomous factory manager agents using NemoClaw, AI-Q Blueprint, and DGX Station (GB300 with 20 PFLOPS FP4, 748GB coherent memory). It unifies live machine signals, quality systems, and robot fleets under an AI decision layer. Foxconn, Pegatron, Advantech, and Wistron are early adopters, projecting 80% faster root cause analysis and 15% labor productivity gains.
NVIDIA Locks Taiwan Supply Chain with AI Factory Stack, Vera Rubin Production Tied to Proprietary Software
NVIDIA partners with TSMC, Foxconn, and others to embed its proprietary AI software (cuLitho, Omniverse, Isaac) into semiconductor manufacturing and server assembly, while ramping Vera Rubin NVL72 production. The move uses efficiency gains (e.g., 20-50% cycle time reduction) as bait to lock the supply chain into a full-stack ecosystem, increasing switching costs for partners.
HPE Launches Vera CPU Server for Agentic AI, Reshaping Server Ecosystem
HPE unveils ProLiant DL394 Gen12 with NVIDIA Vera CPU, purpose-built for agentic AI and reinforcement learning. It offers extreme single-core performance and high memory bandwidth, with HPE iLO security and Compute Ops Management. The platform is validated with Redpanda and NYSE for financial workloads.
NVIDIA BlueField DPU In-Silicon Security Shifts AI Factory Control from Software to Hardware
NVIDIA unveils DOCA security stack (Argus, Vault, Flow) on BlueField-4 DPU, enabling hardware-isolated runtime threat detection via zero-copy memory analysis, zero-trust file access, and 800 Gb/s network enforcement. This shifts security control from host OS to DPU silicon, delivering distributed full-stack protection without compromising AI throughput, but deeply ties to Vera Rubin platform, creating ecosystem lock-in.
NVIDIA Vera CPU: Custom Olympus Core and LPDDR5X Redefine CPU for Agentic AI Factories
NVIDIA unveils Vera CPU with 88 custom Olympus cores, 1.2TB/s LPDDR5X bandwidth, and SCF fabric, targeting CPU execution bottlenecks in agentic AI and reinforcement learning. Claiming 1.8x performance over x86 and memory power under 30W, it shifts AI factory metrics from cores-per-dollar to tokens-per-dollar.
Intel Reclaims AI Control Plane: Xeon 6+ and E835 Target Agentic Orchestration
Intel launches Xeon 6+ (288 E-cores on 18A), E835 200GbE controllers, and Crescent Island GPU. The strategy repositions the CPU as the control plane for agentic AI orchestration and data movement, while using E835 Ethernet to standardize AI data center networking.
NVIDIA RTX Spark: SoC Seizes PC Control, AI Compute Revolution with Ecosystem Lock-in
NVIDIA launches RTX Spark SoC, integrating Blackwell GPU with 20-core Grace CPU (MediaTek co-designed), NVLink-C2C at 600GB/s, up to 128GB unified memory, 1 petaflop FP4 AI, and local 120B-parameter LLM support. This marks a shift from GPU vendor to platform provider, directly challenging Apple M, Qualcomm, and x86 incumbents.
NVIDIA Blackwell Ultra GB300 NVL72: 1.44 EFLOPS FP4, 50x AI Factory Boost
NVIDIA launches Blackwell Ultra GB300 NVL72 rack system with 72 Blackwell Ultra GPUs and 36 Grace CPUs, delivering 1,440 PFLOPS FP4 sparse, 20TB HBM3e, 130TB/s NVLink. Claims 50x AI factory output over Hopper. Available now.
NVIDIA's Triple Play: Vera CPU, N1X Laptop Chip, and $6.5B Silicon Photonics Reshape AI Infra Control
NVIDIA delivers first agent-specific Vera CPU (88 Arm v9.2 cores, 1.2TB/s memory bandwidth), teases consumer N1X laptop chip, and invests $6.5B in silicon photonics. This shifts AI orchestration control from x86 to NVIDIA's Arm ecosystem, while CPO addresses memory wall, but volume production remains challenging until post-2028.
Cisco Scale-Across: Converged Silicon and Optics for Distributed AI Training
Cisco unveils Scale-Across architecture combining Silicon One P200 routing (51.2Tbps) and coherent pluggables (400G/800G ZR/ZR+) with open line systems, enabling deterministic low-latency, lossless connectivity for distributed AI training across data centers separated by tens of kilometers.
NVIDIA Vera CPU Benchmark Crushes x86: Memory Bandwidth Hegemony for Agentic AI
Phoronix benchmarks show NVIDIA Vera CPU with 88 custom Olympus cores (Armv9.2), 1.2TB/s LPDDR5X bandwidth, and 450W TDP outperforming Intel/AMD x86 across agentic AI workloads. It achieves 1.5x overall performance vs 128-core x86, 90% STREAM TRIAD efficiency, and 20-second Linux kernel compilation.