Reports
AI-generated structured vendor updates
Intel at Computex 2026: CPU as Agentic AI Orchestrator, x86 Reclaims Inference Control
At Computex 2026, Intel unveiled the 288-core Xeon 6+ (Intel 18A) and 3rd-gen Core Ultra, claiming Agentic AI shifts CPU:GPU ratio from 1:8 to 1:1. Partnering with SambaNova and Foxconn for rack-scale inference systems, Intel repositions the CPU as the orchestrator for multi-step AI reasoning, aiming to reclaim control from GPU-centric architectures.
Arm Doubles AGI CPU Revenue Target, Signaling Pivot from IP Licensor to Direct Silicon Competitor
Arm reported record FY2026 revenue of $4.92B and doubled its AGI CPU revenue forecast to over $2B by 2028. The 136-core, 3nm, 300W processor, co-developed with Meta, targets AI Agent workloads and has attracted OpenAI and major hyperscalers. This marks Arm's strategic shift from IP licensing to direct silicon competition, triggering FTC antitrust scrutiny.
Fortinet FortiAIGate with NVIDIA Shifts AI Security Control to GPU-Accelerated Inline
Fortinet launches FortiAIGate integrating NVIDIA Blackwell GPU and Dynamo inference framework for inline AI workload protection across data center, cloud, and edge. Promises ultra-low latency, multi-tenancy, and data sovereignty compliance.
英伟达五年来首次发债筹集200亿美元
...
NVIDIA携手Coherent扩产AI光互联
...
NVIDIA and Coherent Scale InP Optical Interconnects for AI Rack-Scale Photonic Fabric
NVIDIA invests in Coherent to build a 6-inch InP wafer fab in Texas, dedicated to optical interconnects for AI racks. The Vera Rubin Ultra NVL576 cluster (576 GPUs across 8 racks) demands photonic links; copper is no longer viable. This signals a fundamental shift from copper to photonics in AI fabric.
AMD Critical RCE Vulnerability Disclosed After 124 Days, Sparks AI Infrastructure Security Crisis
Security researcher mr.bruh publicly disclosed a critical remote code execution (RCE) vulnerability in AMD processors after 124 days without a fix, with AMD refusing a $10,000 bounty. The flaw affects AI servers running AMD EPYC and Instinct, likened to a Log4j moment for AI infrastructure, forcing enterprises to reassess chip-level security response and supply chain risk.
MediaTek Doubles AI ASIC Target to $2B, Challenges Broadcom in Data Center Custom Silicon
MediaTek doubles its 2026 AI ASIC revenue target to $2B, leveraging Google hyperscaler deals and the NVIDIA RTX Spark chip (featuring MediaTek's N1X Arm CPU). It aims for 10-15% of the $70-80B custom AI chip market by 2027, directly challenging Broadcom's dominance.
Qualcomm's $8B Tenstorrent Bet: A RISC-V Chiplet Lock-in Play
Qualcomm is in talks to acquire AI chip startup Tenstorrent for $8-10 billion, targeting its RISC-V-based AI accelerators and chiplet technology. This move aims to reduce Arm dependency and bolster data center AI inference capabilities, marking a strategic pivot from mobile to infrastructure.
MediaTek AI ASIC Deal with Google Reshapes Custom Silicon Landscape
MediaTek's landmark ASIC deal with Google for AI infrastructure doubles 2026 revenue target to $2B. Joint N1X CPU with Nvidia for RTX Spark AI PC and potential SpaceX/xAI orders on Intel 14A process signal a strategic pivot from consumer chips to AI custom silicon, challenging Broadcom's dominance.
AMD Zen 6 Venice 256-Core EPYC Claims 3.3x Rack Performance Over NVIDIA Vera, But Estimates Raise Questions
AMD unveils first estimated performance of Zen 6 Venice EPYC (2nm, 256 cores), claiming 3.3x rack-level integer throughput over NVIDIA Vera at 100kW total power. A direct counter to NVIDIA's Arm push, but based on projected estimates, not silicon.
AMD Backs All-Instinct GPU Cloud: TensorWave's $350M Series B Signals NVIDIA Ecosystem Breakout
TensorWave closes $350M Series B led by Magnetar and AMD Ventures at $1.55B valuation. The cloud is exclusively built on AMD Instinct GPUs (MI300X to MI455X), targeting memory-intensive AI workloads to offer a viable alternative to NVIDIA CUDA lock-in and validate ROCm software stack maturity in production.
Microsoft Maia 200 Mass-Produced, Cobalt 200 Previewed: AI Inference Control Shifts to Azure
At Build 2026, Microsoft announced mass production of Maia 200 AI inference chips, preview of Cobalt 200 ARM processors, and the MAI-Thinking-1 reasoning model (35B params). This signals a full-stack vertical integration to reduce NVIDIA dependency and lock Azure AI workloads.
GTC Taipei 2026: DSX Open Source Data Center Platform, 40% More Chips Under Same Power
NVIDIA launched open-source data center software platform DSX at GTC Taipei 2026, providing planning, deployment, and monitoring tool suite. Key advantage: deploy up to 40% more accelerator chips under same power budget. Huang claims zero-cost factory digital twins. Also launched DGX Station for Windows, 748GB unified memory, 20 petaflops FP4, Q4 2026 availability.
Computex 2026: Qualcomm Dragonfly Data Center Brand Launch
Qualcomm CEO Amon defined 2026 as the Year of Agents at Computex 2026 opening keynote, introducing the Compute Continuum concept—cloud and edge converging into a unified system. Launched data center business brand Dragonfly, details at June investor day. Completes Qualcomm's coverage from milliwatt wearables to data centers. Snapdragon C platform targets sub-$700 entry laptops. Amon emphasized the Agent era requires entirely new device designs.
GTC Taipei 2026: Vera 88-Core CPU Designed for Agents, 1.8x x86 Performance
NVIDIA launched first standalone data center microprocessor Vera at GTC Taipei 2026, directly competing with Intel Xeon and AMD EPYC for the first time. 88 custom Olympus Arm cores, monolithic mesh (not chiplet), 50% faster inter-core communication. LPDDR5X 1.2TB/s bandwidth, PCIe Gen6. Agent sandbox 1.8x x86. First customers: OpenAI, Anthropic, SpaceX. Q3 2026 production, FY CPU revenue target $20B. Marks NVIDIA's strategic leap from GPU accelerator vendor to full-stack data center platform vendor.
Intel Unveils Rack-Scale AI Inference with Xeon 6+ and SambaNova RDU, Targeting Agentic Workloads
Intel announces rack-scale AI infrastructure combining Xeon 6+ (288 cores, Intel 18A) and SambaNova SN-50 RDU for agentic inference. Also launches Vector Core Compute cloud with decoupled prefill/decode using Xeon, SambaNova, and NVIDIA Blackwell. Aims to disrupt GPU-centric inference by offering lower TCO and higher density.
NVIDIA DSX: Open-Source Power Orchestration Steals AI DC Control Plane
NVIDIA unveils DSX, an open-source DC platform that enables 40% more accelerators under the same power budget via software-defined power orchestration and digital twin validation. It shifts DC control from hardware to NVIDIA's software stack.
Qualcomm Unveils Dragonfly Data Center Brand, ARM-Based Compute Targets Enterprise AI Inference
Qualcomm announces Dragonfly, its new data center brand at Computex 2026, signaling a strategic expansion from mobile to enterprise compute. Leveraging ARM architecture, the brand targets low-power AI inference and edge computing. Specific product details will be revealed at an investor day in late June. The company also introduces Snapdragon C, an entry-level platform competing with Apple's MacBook Neo.
NVIDIA Vera 88-Core Arm CPU: Control Plane Shifts from x86 to NVIDIA for AI Agent Workloads
NVIDIA unveils Vera, its first standalone datacenter CPU with 88 custom Arm Olympus cores, monolithic mesh, 1.2TB/s LPDDR5X bandwidth, achieving 1.8x x86 performance in agent workloads. Tightly coupled with GPUs via NVLink-C2C, Vera shifts the control plane from Intel/AMD to NVIDIA. First customers: OpenAI, Anthropic. Production Q3 2026.