智能体 - AI Infrastructure Intelligence Search

NVIDIA Other 2026-06-25

NVIDIA Unveils Vera CPU for AI Agents, Shifting Control from x86 to Proprietary Silicon

At the annual meeting, Huang announced Vera CPU for AI agents paired with Rubin GPU, claimed Blackwell delivers 30x token throughput over next-best platform, and reiterated CUDA as a moat. This move aims to shift AI compute control from general-purpose CPUs to NVIDIA's proprietary architecture.

MediaTek Other 2026-06-23

MediaTek Lands Exclusive Google TPU v9 Inference Upgrade Triggerfish with 2x SRAM

Google plans a TPU v9 inference upgrade, Triggerfish, exclusively fabbed by MediaTek. It features 2-3x on-chip SRAM, HBM4E DRAM, and a simulation die for local management. Production starts late 2027 with 1-2M units lifecycle, unit price ~30% higher than Humufish.

Intel Other 2026-06-23

Intel AI Box Ultra Hits the Road: PC-class Compute Enters Car, Locks Down Edge AI Ecosystem

Intel and Changan Auto launch the AI Box Ultra solution based on the Core Ultra platform, bringing PC-class compute and Android app ecosystem to the cockpit. It emphasizes on-device AI inference, privacy, and offline capability. The move targets Qualcomm and NVIDIA but hides X86 power/thermal drawbacks.

Google Cloud Other 2026-06-23

Google Cloud and Nokia Embed Gemini AI Agents to Seize Network Operations Control Plane

Google Cloud and Nokia partner to embed Gemini AI agents (including Router Agent, Event Triage Agent) into Nokia Assurance Center, launching as SaaS on Google Cloud Marketplace in September 2026. Aiming to reduce troubleshooting time by 50-80%, this marks a fundamental shift from rule-based to AI-driven telco operations.

Nokia Other 2026-06-23

Nokia and Google Cloud Inject Gemini AI into Network Assurance

Nokia integrates Google's Gemini AI into its Assurance Center, creating six AI agents for event triage, anomaly detection, and remediation. Claims 50-80% reduction in troubleshooting time. The SaaS solution will run on Google Cloud, launching September 2026.

Intel Other 2026-06-23

Intel at Computex 2026: CPU as Agentic AI Orchestrator, x86 Reclaims Inference Control

At Computex 2026, Intel unveiled the 288-core Xeon 6+ (Intel 18A) and 3rd-gen Core Ultra, claiming Agentic AI shifts CPU:GPU ratio from 1:8 to 1:1. Partnering with SambaNova and Foxconn for rack-scale inference systems, Intel repositions the CPU as the orchestrator for multi-step AI reasoning, aiming to reclaim control from GPU-centric architectures.

Meta Other 2026-06-22

Arm's Self-Designed AGI CPU with Meta: Ecosystem Shift from Licensor to Silicon Vendor

Arm unveils its first self-designed data center CPU, the AGI CPU, with 136 cores on 3nm, purpose-built for agentic AI inference. Co-developed with Meta, which will deploy it across its data centers. Claims 2x rack performance over x86, reducing AI capex by $100B per gigawatt. Signals Arm's shift from IP licensing to direct silicon sales, reshaping ecosystem dynamics.

Microsoft Azure Other 2026-06-22

Google unveils 8th-gen TPU: 3x training speed, 3x SRAM for inference, redefines AI compute TCO

At Cloud Next 2026, Google launched 8th-gen TPU with dual variants: TPU 8t for training (9600 per pod, 2PB shared memory) and TPU 8i for inference (1152 per pod, 3x on-chip SRAM). Also announced Gemini Enterprise Agent Platform, N4 Axion ARM instances (2x price-performance vs x86), and AI-driven security with Wiz.

Google Other 2026-06-22

Google Antigravity 2.0 Replaces IDE with AI Agents, Forces Gemini CLI Migration

Google launches Antigravity 2.0, a revolutionary AI coding platform with desktop app, CLI, SDK, and Managed Agents API. It forces migration from Gemini CLI to Antigravity CLI, introduces Gemini Spark personal AI agent running on Google Cloud VM, and upgrades coding assistance from editor feature to software labor operating system.

ARM Other 2026-06-22

Arm AGI CPU Demand Doubles, Targets AI Inference Control, Threatens x86 Dominance

Arm doubled its demand forecast for its first in-house datacenter CPU, the AGI CPU, projecting over $2B revenue in FY2027-2028. The 136-core, 3nm Neoverse V3-based chip targets agentic AI inference, claiming 2x rack-level performance over x86. Meta is a key partner; OpenAI, Cloudflare also onboard. This marks Arm's strategic pivot from IP licensor to direct silicon vendor.

Qualcomm Other 2026-06-22

Qualcomm Launches Dragonfly Datacenter Brand, ARM AI Chips Target Intel, AMD, NVIDIA

Qualcomm announced Dragonfly datacenter brand at Computex 2026, including custom ASICs, standard CPUs, and dedicated AI accelerators, extending computing from edge to cloud. First ASIC shipments moved up to 2026. Analysts project $3B revenue in FY2027. This marks Qualcomm's formal entry into the datacenter, challenging X86 and GPU ecosystems.

Intel Other 2026-06-22

Intel Launches Xeon 6+ with 288 Cores, Reclaims AI Control Plane

Intel unveils Xeon 6+ (288 E-cores, 576MB L3, 18A process), Ethernet 800 E835 controller (200GbE), and next-gen GPU Crescent Island at Computex 2026. Partnerships with SambaNova and Foxconn for rack-scale AI. Strategy: Xeon as the control plane for Agentic AI.

Cloudflare Other 2026-06-19

Cloudflare Targets Rule of 50: Agentic AI Traffic Reshapes Edge Control Plane

Cloudflare raised its long-term financial target from Rule of 40 to Rule of 50 at its Investor Day, and acquired VoidZero, the creator of Vite. This move explicitly positions the Workers platform as the default deployment environment for AI agents, capitalizing on the inflection point where bot/AI traffic now exceeds human traffic, and capturing value by controlling the edge compute layer.

Google Other 2026-06-19

Google and XREAL Launch Android XR Smart Glasses: AI Platform Control Shift Intensifies

Google and XREAL launch Project Aura, the first XR glasses running Android XR, Qualcomm's Reality Elite chip, and Gemini AI. This move aims to capture OS control in spatial computing via an open platform and AI integration, challenging Apple and Meta's closed ecosystems.

CrowdStrike Other 2026-06-19

CrowdStrike Seizes AI Agent Identity Control Plane with Continuous Authorization

CrowdStrike launches Continuous Identity for AI Agents, leveraging SGNL acquisition, to replace static permissions with real-time, risk-based authorization via SPIFFE standards, positioning Falcon as the identity control plane for agentic enterprises.

Ericsson Other 2026-06-17

Ericsson Abandons Full-Stack for Modular DMP, IBM watsonx Orchestrate Enters Telco Automation

Ericsson modularizes its Digital Monetization Platform (DMP), cedes CRM front-end to Salesforce, and brings in IBM as system integrator with watsonx Orchestrate multi-agent orchestrator, retreating to software licensing to address 5G ROI pressure.

Google Cloud Other 2026-06-17

Google Cloud's OKF v0.1: A Markdown-Based Control Plane for AI Agent Knowledge

Google Cloud introduces Open Knowledge Format (OKF) v0.1, a vendor-neutral Markdown spec for structuring context for AI agents. It represents knowledge as directories of markdown files with YAML front matter, requiring no proprietary services or SDKs, and can be hosted on any file system, targeting enterprise knowledge fragmentation and interoperability.

Anthropic Other 2026-06-17

Anthropic Agent SDK计费独立，AI编程进入生产级工程化

...

OpenAI Other 2026-06-17

OpenAI buys Ona: Control point shifts to persistent AI agent runtime

OpenAI acquires cloud infrastructure startup Ona to integrate its persistent execution environment into Codex, enabling AI agents to run independently for hours or days in enterprise-owned clouds. This addresses security, governance, and audit requirements, signaling OpenAI's shift from model provider to full-stack AI platform.

OpenAI Other 2026-06-15

OpenAI IPO Super-App Pivot: GPT-5.6, Ads Expansion, and Ecosystem Lock-in Risks

OpenAI files IPO, planning to transform ChatGPT into a super-app with coding tools, AI agents, and ads. GPT-5.6 will support 1.5M token context window, while API pricing drops to compete. This marks a shift from model provider to platform ecosystem, raising lock-in concerns for enterprises.

Reports

Filter