Reports
AI-generated structured vendor updates
NVIDIA AgentPerf Benchmark: Blackwell Ultra Delivers 20x More Agents per Megawatt vs Hopper
NVIDIA and Artificial Analysis unveil AgentPerf, the first benchmark for agentic AI workloads. Results show the GB300 NVL72 platform delivers up to 20x more concurrent agents per megawatt than the HGX H200 when running DeepSeek V4 Pro, using real coding agent trajectories to measure throughput and responsiveness.
Anthropic Locks Regulated Industries via DXC: Claude-Certified Engineers and OASIS Platform as New Control Points
Anthropic forms a global alliance with DXC Technology, training tens of thousands of Claude-certified forward-deployed engineers to embed Claude into mission-critical systems for banks, airlines, and regulated industries. DXC's OASIS platform defaults to Claude, with over 95% of its code generated by Claude, creating deep dependency.
NVIDIA Halos OS: A Certified Safety OS That Seizes Control of Autonomous Driving
NVIDIA introduces Halos OS, a full-stack safety system comprising ASIL D certified Halos Core, standardized Halos SDK, AI guardrails in Halos Applications, and cloud-based Safety Evaluation Framework. Built on DRIVE Hyperion, it aims to embed safety into L4 robotaxis from the ground up.
Microsoft & NVIDIA RTX Spark Brings 1 Petaflop AI to Windows, Reshaping Local Inference
At Computex 2026, Microsoft unveiled RTX Spark, an Arm-based AI superchip co-developed with NVIDIA and MediaTek, delivering up to 1 petaflop AI performance and 128GB unified memory for local 120B parameter models. Intel Arc G3 and Qualcomm Snapdragon X2 series also launched, accelerating the Windows AI PC ecosystem.
NVIDIA Locks Local AI Inference Control with DiffusionGemma Parallel Generation
NVIDIA optimizes Google DeepMind's DiffusionGemma open model, which generates 256 tokens in parallel for 4x speedup over autoregressive models. Achieves 1000 tokens/sec on H100, 150 tokens/sec on DGX Spark, running fully locally with no cloud cost. This reinforces NVIDIA GPU's centrality in compute-bound local AI inference.
Arm's Neural Dawn: Dedicated Neural Accelerators Redefine Mobile GPU Roadmap
Arm and Sumo Digital unveil Neural Dawn, the first mobile game to use Unreal Engine MegaLights. By integrating dedicated neural accelerators into next-gen Mali GPUs, it delivers desktop-class ray-traced lighting within mobile power limits, signaling a shift from traditional to AI-native graphics pipelines.
NVIDIA's UK Sovereign AI Play: From Chip Vendor to National Infrastructure Controller
NVIDIA partners with the UK government to deploy sovereign AI infrastructure via Isambard-AI (5,400 GH200 superchips) and the Sovereign AI Fund, backing local startups. This move establishes a national AI control plane, locking compute into NVIDIA's ecosystem and bypassing traditional hyperscalers like AWS and Azure.
NVIDIA and LG Build AI Factory: DSX Platform Locks Physical AI Stack
NVIDIA and LG Group jointly build an AI factory leveraging NVIDIA's DSX platform, integrating Isaac Sim/Lab, Cosmos, GR00T frameworks for robotics, autonomous driving, data centers, and sovereign AI. LG subsidiaries align cooling, robotics, and sensor components exclusively with NVIDIA, creating a fortified ecosystem.
NVIDIA and Doosan: Full-Stack Physical AI Platform Restructures Industrial Automation
NVIDIA expands collaboration with Doosan Group to integrate its physical AI stack (Isaac Sim, Cosmos, Jetson Thor) into Doosan Robotics' Agentic Robot OS, explore AI factory power (SMR, hydrogen fuel cells), and MGX ecosystem PCB materials. This move transforms NVIDIA from a GPU vendor into the central platform for physical AI and AI factory infrastructure, deeply locking industrial automation partners.
NVIDIA RTX Spark Superchip: Local AI Agents and AAA Gaming Converge in Ultra-Thin Laptops
NVIDIA unveils RTX Spark, a superchip integrating GPU, CPU, and AI acceleration for Windows PCs, delivering 1440p >100fps ray-traced gaming and local AI agent inference. Partnering with KRAFTON, NC, Riot Games, and T1, it debuts in Korean PC Bangs. This marks NVIDIA's strategic pivot from discrete GPUs to personal computing SoCs, targeting the era of personal AI.
Huawei Cloud Launches AICS: Control Plane Shift in the Token Industrialization Era
Huawei Cloud unveils four Agentic Infra products, led by the AICS cluster (100K cards/200 EFLOPS). It integrates NPU-direct CMS memory, CCE VolcanoNext unified scheduling, and AgentSphere security sandbox to create a unified control plane for LLM training and Agent inference, aiming to lock in the full-stack AI infrastructure.
Cloudflare Acquires VoidZero: Capturing Dev Pipeline via Vite Integration
Cloudflare acquires VoidZero, bringing Vite, Rolldown, Oxc and other Rust-native tools into Workers, enabling one-click deploy from local code to global edge. This aims to unify the full dev lifecycle and push intent-based infrastructure provisioning.
Microsoft Build 2026: Unifying Agent Stack from Chip to Cloud
At Build 2026, Microsoft unveiled a comprehensive agent-era platform: Project Solara (chip-to-cloud), Microsoft IQ (unified grounding), Rayfin (backend generation), Azure HorizonDB, and GPU-accelerated analytics. The goal is to lock developers into Microsoft's ecosystem.
Intel at Computex 2026: 18A, Rackscale, and the Shift to CPU-Centric AI Orchestration
Intel unveils Core Ultra Series 3 on 18A, Xeon 6+ with 288 e-cores, a hybrid local inference orchestrator with Perplexity, rackscale AI infrastructure with Foxconn, and disaggregated inference cloud with SambaNova. The keynote positions the CPU as the central orchestrator for agentic AI, signaling a control plane shift from GPU to x86.
Computex 2026: Qualcomm Dragonfly Data Center Brand Launch
Qualcomm CEO Amon defined 2026 as the Year of Agents at Computex 2026 opening keynote, introducing the Compute Continuum concept—cloud and edge converging into a unified system. Launched data center business brand Dragonfly, details at June investor day. Completes Qualcomm's coverage from milliwatt wearables to data centers. Snapdragon C platform targets sub-$700 entry laptops. Amon emphasized the Agent era requires entirely new device designs.
Cisco Live 2026: AI Defense Upgrades with Policy Studio, Adaptive Red Teaming, Agent Supply Chain Security
At Cisco Live 2026, Cisco unveiled AI Defense upgrades: adaptive red teaming, Policy Studio for natural language policy, and agent supply chain security with CI/CD integration. It also launched AgenticOps autonomous network operations and native integrations with Amazon Bedrock, Google ADK, LangChain, aiming to secure multi-framework agent environments.
Intel and SambaNova Rackscale AI: CPU Regains Inference Control Plane
At Computex 2026, Intel unveiled rack-scale AI infrastructure combining Xeon 6+ with SambaNova SN-50 RDUs, plus a fully disaggregated inference cloud (prefill on NVIDIA Blackwell, decode on RDUs) by Vector Core Compute. This aims to reposition the CPU as the central orchestrator for inference, challenging GPU dominance.
NVIDIA Transaction Foundation Models Shift Financial AI Control to Unified GPU Stack
NVIDIA launches a developer example for transaction foundation models, partnering with Revolut, Mastercard, and others to replace siloed ML models with unified transformer-based systems. Leveraging Hopper GPUs, cuDF, and Nemotron, it shifts financial data processing from feature engineering to unified embeddings, effectively moving control to NVIDIA's hardware ecosystem.
Arm and NVIDIA RTX Spark: Unified Memory PC Architecture Targets Agentic AI, Encircles x86
Arm and NVIDIA unveil RTX Spark, an Arm-based Grace CPU + Blackwell RTX GPU platform with unified memory, targeting Windows on Arm for agentic AI inference. It delivers 1 Petaflop, reduces token cost, and signals a PC paradigm shift from app-driven to agent-driven, backed by Microsoft.
Qualcomm Unveils Dragonfly Data Center Brand, ARM-Based Compute Targets Enterprise AI Inference
Qualcomm announces Dragonfly, its new data center brand at Computex 2026, signaling a strategic expansion from mobile to enterprise compute. Leveraging ARM architecture, the brand targets low-power AI inference and edge computing. Specific product details will be revealed at an investor day in late June. The company also introduces Snapdragon C, an entry-level platform competing with Apple's MacBook Neo.