Reports
AI-generated structured vendor updates
NVIDIA Unveils Vera CPU for AI Agents, Shifting Control from x86 to Proprietary Silicon
At the annual meeting, Huang announced Vera CPU for AI agents paired with Rubin GPU, claimed Blackwell delivers 30x token throughput over next-best platform, and reiterated CUDA as a moat. This move aims to shift AI compute control from general-purpose CPUs to NVIDIA's proprietary architecture.
MediaTek Lands Exclusive Google TPU v9 Inference Upgrade Triggerfish with 2x SRAM
Google plans a TPU v9 inference upgrade, Triggerfish, exclusively fabbed by MediaTek. It features 2-3x on-chip SRAM, HBM4E DRAM, and a simulation die for local management. Production starts late 2027 with 1-2M units lifecycle, unit price ~30% higher than Humufish.
Intel AI Box Ultra Hits the Road: PC-class Compute Enters Car, Locks Down Edge AI Ecosystem
Intel and Changan Auto launch the AI Box Ultra solution based on the Core Ultra platform, bringing PC-class compute and Android app ecosystem to the cockpit. It emphasizes on-device AI inference, privacy, and offline capability. The move targets Qualcomm and NVIDIA but hides X86 power/thermal drawbacks.
Google Cloud and Nokia Embed Gemini AI Agents to Seize Network Operations Control Plane
Google Cloud and Nokia partner to embed Gemini AI agents (including Router Agent, Event Triage Agent) into Nokia Assurance Center, launching as SaaS on Google Cloud Marketplace in September 2026. Aiming to reduce troubleshooting time by 50-80%, this marks a fundamental shift from rule-based to AI-driven telco operations.
Nokia and Google Cloud Inject Gemini AI into Network Assurance
Nokia integrates Google's Gemini AI into its Assurance Center, creating six AI agents for event triage, anomaly detection, and remediation. Claims 50-80% reduction in troubleshooting time. The SaaS solution will run on Google Cloud, launching September 2026.
Intel at Computex 2026: CPU as Agentic AI Orchestrator, x86 Reclaims Inference Control
At Computex 2026, Intel unveiled the 288-core Xeon 6+ (Intel 18A) and 3rd-gen Core Ultra, claiming Agentic AI shifts CPU:GPU ratio from 1:8 to 1:1. Partnering with SambaNova and Foxconn for rack-scale inference systems, Intel repositions the CPU as the orchestrator for multi-step AI reasoning, aiming to reclaim control from GPU-centric architectures.
Arm's Self-Designed AGI CPU with Meta: Ecosystem Shift from Licensor to Silicon Vendor
Arm unveils its first self-designed data center CPU, the AGI CPU, with 136 cores on 3nm, purpose-built for agentic AI inference. Co-developed with Meta, which will deploy it across its data centers. Claims 2x rack performance over x86, reducing AI capex by $100B per gigawatt. Signals Arm's shift from IP licensing to direct silicon sales, reshaping ecosystem dynamics.
Google unveils 8th-gen TPU: 3x training speed, 3x SRAM for inference, redefines AI compute TCO
At Cloud Next 2026, Google launched 8th-gen TPU with dual variants: TPU 8t for training (9600 per pod, 2PB shared memory) and TPU 8i for inference (1152 per pod, 3x on-chip SRAM). Also announced Gemini Enterprise Agent Platform, N4 Axion ARM instances (2x price-performance vs x86), and AI-driven security with Wiz.
Google Antigravity 2.0 Replaces IDE with AI Agents, Forces Gemini CLI Migration
Google launches Antigravity 2.0, a revolutionary AI coding platform with desktop app, CLI, SDK, and Managed Agents API. It forces migration from Gemini CLI to Antigravity CLI, introduces Gemini Spark personal AI agent running on Google Cloud VM, and upgrades coding assistance from editor feature to software labor operating system.
Arm AGI CPU Demand Doubles, Targets AI Inference Control, Threatens x86 Dominance
Arm doubled its demand forecast for its first in-house datacenter CPU, the AGI CPU, projecting over $2B revenue in FY2027-2028. The 136-core, 3nm Neoverse V3-based chip targets agentic AI inference, claiming 2x rack-level performance over x86. Meta is a key partner; OpenAI, Cloudflare also onboard. This marks Arm's strategic pivot from IP licensor to direct silicon vendor.
Qualcomm Launches Dragonfly Datacenter Brand, ARM AI Chips Target Intel, AMD, NVIDIA
Qualcomm announced Dragonfly datacenter brand at Computex 2026, including custom ASICs, standard CPUs, and dedicated AI accelerators, extending computing from edge to cloud. First ASIC shipments moved up to 2026. Analysts project $3B revenue in FY2027. This marks Qualcomm's formal entry into the datacenter, challenging X86 and GPU ecosystems.
Intel Launches Xeon 6+ with 288 Cores, Reclaims AI Control Plane
Intel unveils Xeon 6+ (288 E-cores, 576MB L3, 18A process), Ethernet 800 E835 controller (200GbE), and next-gen GPU Crescent Island at Computex 2026. Partnerships with SambaNova and Foxconn for rack-scale AI. Strategy: Xeon as the control plane for Agentic AI.
Cloudflare Targets Rule of 50: Agentic AI Traffic Reshapes Edge Control Plane
Cloudflare raised its long-term financial target from Rule of 40 to Rule of 50 at its Investor Day, and acquired VoidZero, the creator of Vite. This move explicitly positions the Workers platform as the default deployment environment for AI agents, capitalizing on the inflection point where bot/AI traffic now exceeds human traffic, and capturing value by controlling the edge compute layer.
Google and XREAL Launch Android XR Smart Glasses: AI Platform Control Shift Intensifies
Google and XREAL launch Project Aura, the first XR glasses running Android XR, Qualcomm's Reality Elite chip, and Gemini AI. This move aims to capture OS control in spatial computing via an open platform and AI integration, challenging Apple and Meta's closed ecosystems.
CrowdStrike Seizes AI Agent Identity Control Plane with Continuous Authorization
CrowdStrike launches Continuous Identity for AI Agents, leveraging SGNL acquisition, to replace static permissions with real-time, risk-based authorization via SPIFFE standards, positioning Falcon as the identity control plane for agentic enterprises.
Ericsson Abandons Full-Stack for Modular DMP, IBM watsonx Orchestrate Enters Telco Automation
Ericsson modularizes its Digital Monetization Platform (DMP), cedes CRM front-end to Salesforce, and brings in IBM as system integrator with watsonx Orchestrate multi-agent orchestrator, retreating to software licensing to address 5G ROI pressure.
Google Cloud's OKF v0.1: A Markdown-Based Control Plane for AI Agent Knowledge
Google Cloud introduces Open Knowledge Format (OKF) v0.1, a vendor-neutral Markdown spec for structuring context for AI agents. It represents knowledge as directories of markdown files with YAML front matter, requiring no proprietary services or SDKs, and can be hosted on any file system, targeting enterprise knowledge fragmentation and interoperability.
Anthropic Agent SDK计费独立,AI编程进入生产级工程化
...
OpenAI buys Ona: Control point shifts to persistent AI agent runtime
OpenAI acquires cloud infrastructure startup Ona to integrate its persistent execution environment into Codex, enabling AI agents to run independently for hours or days in enterprise-owned clouds. This addresses security, governance, and audit requirements, signaling OpenAI's shift from model provider to full-stack AI platform.
OpenAI IPO Super-App Pivot: GPT-5.6, Ads Expansion, and Ecosystem Lock-in Risks
OpenAI files IPO, planning to transform ChatGPT into a super-app with coding tools, AI agents, and ads. GPT-5.6 will support 1.5M token context window, while API pricing drops to compete. This marks a shift from model provider to platform ecosystem, raising lock-in concerns for enterprises.