Reports
AI-generated structured vendor updates
Intel at Computex 2026: 18A, Rackscale, and the Shift to CPU-Centric AI Orchestration
Intel unveils Core Ultra Series 3 on 18A, Xeon 6+ with 288 e-cores, a hybrid local inference orchestrator with Perplexity, rackscale AI infrastructure with Foxconn, and disaggregated inference cloud with SambaNova. The keynote positions the CPU as the central orchestrator for agentic AI, signaling a control plane shift from GPU to x86.
Intel and SambaNova Rackscale AI: CPU Regains Inference Control Plane
At Computex 2026, Intel unveiled rack-scale AI infrastructure combining Xeon 6+ with SambaNova SN-50 RDUs, plus a fully disaggregated inference cloud (prefill on NVIDIA Blackwell, decode on RDUs) by Vector Core Compute. This aims to reposition the CPU as the central orchestrator for inference, challenging GPU dominance.
HBM Profitability Falls Below DDR5, TrendForce Warns of Multi-Fold Price Surge in 2027
TrendForce reports that HBM per-wafer revenue fell below DDR5 64GB RDIMM in Q1 2026, making HBM less profitable. Suppliers will reallocate capacity, leading to multi-fold HBM4 contract price increases in 2027. Demand from NVIDIA Rubin Ultra and AI ASICs will further tighten supply.
Arm-NVIDIA RTX Spark: Tightly Coupled CPU-GPU for Agentic AI PCs
The Arm-based NVIDIA RTX Spark integrates Arm Grace CPU with NVIDIA Blackwell RTX GPU via unified memory, enabling ultra-low latency on-device AI inference for the agentic era. This platform marks a major milestone for Windows on Arm, targeting developers, creators, and gamers.
Cisco Locks Security Pipeline: Splunk as Central Hub for Firewall and Runtime Telemetry
Cisco integrates Splunk with Cisco Secure Firewall advanced logging and Isovalent Enterprise Platform (eBPF-based Kubernetes runtime visibility), delivering pre-built detections and correlation. This move aims to transform fragmented security telemetry into high-confidence threat signals, deepening lock-in to Cisco's security platform.
NVIDIA Vera CPU: Custom Olympus Core and LPDDR5X Redefine CPU for Agentic AI Factories
NVIDIA unveils Vera CPU with 88 custom Olympus cores, 1.2TB/s LPDDR5X bandwidth, and SCF fabric, targeting CPU execution bottlenecks in agentic AI and reinforcement learning. Claiming 1.8x performance over x86 and memory power under 30W, it shifts AI factory metrics from cores-per-dollar to tokens-per-dollar.
NVIDIA DSX OS: Open Source Software to Seize AI Factory Control Plane
NVIDIA launches DSX OS, an open-source modular software suite for operating AI factories. Components include DSX Exchange, MaxLPS, NICo, NVSentinel, etc., unifying IT/OT, power optimization, and lifecycle management. Claims 40% more GPUs under fixed power, but core relies on NVIDIA proprietary hardware, aiming to lock users into its ecosystem.
Intel Reclaims AI Control Plane: Xeon 6+ and E835 Target Agentic Orchestration
Intel launches Xeon 6+ (288 E-cores on 18A), E835 200GbE controllers, and Crescent Island GPU. The strategy repositions the CPU as the control plane for agentic AI orchestration and data movement, while using E835 Ethernet to standardize AI data center networking.
Nokia 1830 GX Multi-rail OLS: Density and Power Efficiency Redefine AI Scale-Across Economics
Nokia launches the 1830 GX Multi-rail OLS, supporting 4 fiber rails in 1RU (160 rails per 40RU rack) with >60% power reduction per rail. Designed for AI cluster scale-across, it integrates C+L band EDFA, DGE, OCM, and OTDR, delivering 9.6 THz spectrum per fiber and overcoming space/power constraints at ILA sites.
Cisco Scale-Across: Converged Silicon and Optics for Distributed AI Training
Cisco unveils Scale-Across architecture combining Silicon One P200 routing (51.2Tbps) and coherent pluggables (400G/800G ZR/ZR+) with open line systems, enabling deterministic low-latency, lossless connectivity for distributed AI training across data centers separated by tens of kilometers.
NVIDIA Vera CPU Benchmark Crushes x86: Memory Bandwidth Hegemony for Agentic AI
Phoronix benchmarks show NVIDIA Vera CPU with 88 custom Olympus cores (Armv9.2), 1.2TB/s LPDDR5X bandwidth, and 450W TDP outperforming Intel/AMD x86 across agentic AI workloads. It achieves 1.5x overall performance vs 128-core x86, 90% STREAM TRIAD efficiency, and 20-second Linux kernel compilation.
Cisco G300 Intelligent Packet Flow: Hardware-Accelerated AI Networking Breakthrough
Cisco launches Intelligent Packet Flow on Silicon One G300, transforming the fabric into an intelligent system with hardware-accelerated adaptive routing, collective congestion awareness, and telemetry. In 8K-16K GPU clusters, it reduces CCT by 87% vs ECMP, improves JCT by 82%, and unlocks 28% more GPU efficiency.
Intel Core Ultra 3 SoC Replaces Discrete GPUs in Edge Robotics, Slashing TCO
Intel Core Ultra Series 3 SoC integrates CPU, GPU, and NPU to power edge robotics, replacing discrete GPUs. Partners like Sensory AI run multi-agent AI (vision, language, motion) locally, cutting TCO and eliminating cloud latency. This shifts the cost-performance curve for service robots.
AMD Ryzen AI Halo & Max PRO 400: Local 300B Parameter Inference, but Hidden Lock-in and Thermal Limits
AMD launches Ryzen AI Halo developer platform (128GB unified memory, 200B parameter models) and Ryzen AI Max PRO 400 series (first x86 client to run 300B parameter models locally). Unified memory, ROCm optimization, and OEM partnerships aim to shift agentic AI from cloud to local, but shared memory bandwidth and thermal constraints limit real-world throughput.
Google TPU 8t/8i Enables Cross-Datacenter Training, Gemini 3.5 Flash 4x Faster
Google unveils TPU 8t (training) and TPU 8i (inference) with 3x raw compute and 2x perf-per-watt. JAX/Pathways enable distributed training across 1M+ TPUs across sites. Gemini 3.5 Flash delivers 4x output tokens per second vs frontier models. SynthID adopted by OpenAI, Nvidia, Kakao, Eleven Labs.
Cisco Unified Edge: Service Providers' New Ecosystem Bundle for Edge AI Services
Cisco launches Unified Edge platform integrating compute, networking, storage, and security, managed via Intersight, targeting service providers to deploy AI inference at thousands of edge sites. Verizon as early adopter plans to bundle edge capabilities into enterprise connectivity offerings.
Cisco Uses MRC to Push SRv6: A Stealth Power Grab in AI Networking
Cisco claims MRC protocol is built on its SRv6 architecture, highlighting application-driven networking, static routing reliability, and deterministic visibility. This is a strategic move to lock AI networking into Cisco's SRv6 ecosystem, countering NVIDIA's Spectrum-X and Arista's alternatives.
Cisco N9300 Smart Switches Embed Security into AI Data Center Fabric
At ONUG 2026, Cisco unveiled Nexus One architecture and N9300 Smart Switches, embedding L4 segmentation, Hypershield, eBPF-based Live Protect, and DPU-integrated firewall directly into the network fabric. This aims to deliver bottleneck-free security for AI workloads while enabling AI-driven operations via AgenticOps and AI Canvas.
Cisco-AMD Benchmark Shifts AI Fabric Control from GPU to SmartNIC and Switch
Cisco and AMD jointly release benchmarks for AI scale-out fabrics using N9000 800G switches, Pensando Pollara 400 smartNICs, and MI300X GPUs. IBPerf and MLPerf tests show P01/P99 bandwidth near 400Gbps line rate under incast congestion, proving deterministic performance that eliminates GPU stalls.
Arm Reports Record Results, AGI CPU Emerges as New AI Infrastructure Focal Point
Arm reported record FY2026 results with $4.92B revenue and over 20% growth for three consecutive years. The core highlight is the Arm AGI CPU designed for agentic AI, securing over $2B in customer demand and backing from Meta, AWS, Google, and others.