Reports
AI-generated structured vendor updates
Google TPU 8th Gen Splits Training and Inference Chips, Inflection Point in AI Infra TCO
Google Cloud unveils 8th-gen TPU with separate training (TPU8t) and inference (TPU8i) chips, delivering 3x training pod performance and 80% inference dollar-performance improvement. Vertex AI evolves into Gemini Enterprise Agent Platform, while the Smals sovereign cloud contract validates public sector AI adoption under strict compliance.
MediaTek AI ASIC Deal with Google Reshapes Custom Silicon Landscape
MediaTek's landmark ASIC deal with Google for AI infrastructure doubles 2026 revenue target to $2B. Joint N1X CPU with Nvidia for RTX Spark AI PC and potential SpaceX/xAI orders on Intel 14A process signal a strategic pivot from consumer chips to AI custom silicon, challenging Broadcom's dominance.
ARM's Pivot to Direct AI Chip Sales: From IP Licensor to Silicon Competitor
ARM accelerates its $15B chip revenue goal by shifting from pure IP licensing to direct AI chip sales, disrupting relationships with Qualcomm and Apple, and challenging Nvidia/Intel, signaling a fundamental ecosystem restructuring.
Qualcomm AI200 on AWS: Inference Chip Ecosystem Shifts from Nvidia Singularity to Multi-Alliance
Qualcomm's AI200 inference chip (768GB memory) is slated for broad AWS deployment by 2026, aiming to reduce cloud AI inference costs. This marks Qualcomm's strategic pivot from mobile to cloud, leveraging AWS's custom silicon initiative to challenge Nvidia's inference monopoly and restructure the cloud inference chip ecosystem.
ASML扩大风险投资布局,加强欧洲半导体和深科技生态系统
...
NVIDIA and SK Hynix Lock Down HBM4/5 Roadmap, Cementing Vera Rubin Supply Chain
NVIDIA and SK Hynix sign a multi-year agreement to co-define HBM4 production and HBM5 pre-research for Vera Rubin GPUs. Samsung also enters HBM4 supply as a second source. The deal elevates SK Hynix from vendor to co-developer, potentially creating a de facto memory standard barrier that marginalizes Micron and others.
Google Awards 3M+ TPU Packaging Orders to Intel Foundry, Breaking TSMC's CoWoS Monopoly
Google has awarded Intel Foundry over 3 million units of next-gen TPU advanced packaging orders, leveraging Intel's EMIB technology with production starting in 2028. This marks Intel Foundry's largest external customer win and a pivotal shift in AI chip packaging away from TSMC's CoWoS monopoly.
Cisco Cloud Control & AI Canvas: The Control Point Shifts from Hardware to the AI Decision Plane
At Cisco Live 2026, Cisco launched Cloud Control, an AI-ops platform with agentic workflows, and AI Canvas for human-agent collaboration. The platform leverages Splunk's data fabric and proprietary models trained on 40 years of Cisco data. The Silicon One architecture now unifies campus and cloud switches. This marks a strategic pivot from hardware vendor to AI platform, shifting the control point to the AI decision plane.
Microsoft Maia 200 Mass-Produced, Cobalt 200 Previewed: AI Inference Control Shifts to Azure
At Build 2026, Microsoft announced mass production of Maia 200 AI inference chips, preview of Cobalt 200 ARM processors, and the MAI-Thinking-1 reasoning model (35B params). This signals a full-stack vertical integration to reduce NVIDIA dependency and lock Azure AI workloads.
GTC Taipei 2026: DSX Open Source Data Center Platform, 40% More Chips Under Same Power
NVIDIA launched open-source data center software platform DSX at GTC Taipei 2026, providing planning, deployment, and monitoring tool suite. Key advantage: deploy up to 40% more accelerator chips under same power budget. Huang claims zero-cost factory digital twins. Also launched DGX Station for Windows, 748GB unified memory, 20 petaflops FP4, Q4 2026 availability.
NVIDIA DSX: Open-Source Power Orchestration Steals AI DC Control Plane
NVIDIA unveils DSX, an open-source DC platform that enables 40% more accelerators under the same power budget via software-defined power orchestration and digital twin validation. It shifts DC control from hardware to NVIDIA's software stack.
NVIDIA RTX Spark: SoC Seizes PC Control, AI Compute Revolution with Ecosystem Lock-in
NVIDIA launches RTX Spark SoC, integrating Blackwell GPU with 20-core Grace CPU (MediaTek co-designed), NVLink-C2C at 600GB/s, up to 128GB unified memory, 1 petaflop FP4 AI, and local 120B-parameter LLM support. This marks a shift from GPU vendor to platform provider, directly challenging Apple M, Qualcomm, and x86 incumbents.
NVIDIA's Triple Play: Vera CPU, N1X Laptop Chip, and $6.5B Silicon Photonics Reshape AI Infra Control
NVIDIA delivers first agent-specific Vera CPU (88 Arm v9.2 cores, 1.2TB/s memory bandwidth), teases consumer N1X laptop chip, and invests $6.5B in silicon photonics. This shifts AI orchestration control from x86 to NVIDIA's Arm ecosystem, while CPO addresses memory wall, but volume production remains challenging until post-2028.
Huawei's Tao Law: LogicFolding Bypasses Lithography, 55% Density Gain on Fixed Node
At ISCAS 2026, Huawei's He Tingbo unveiled the Tao Law, replacing geometric scaling with temporal optimization targeting tau (characteristic time). LogicFolding vertically stacks active layers to shorten critical paths, achieving 55% transistor density increase and 41% energy efficiency gain on a fixed node. Kirin 2026 reaches 3.1GHz; Ascend series will adopt LogicFolding. The roadmap projects equivalent 1.4nm density by 2031, fundamentally challenging Moore's Law's lithography dependency.
NVIDIA and Intel Announce $5 Billion Strategic Partnership: New AI Chip Supply Chain Landscape
NVIDIA and Intel announced a $5 billion strategic partnership on September 18, 2025: NVIDIA invests $5 billion for ~4% Intel stake, while Intel customizes x86 CPUs for NVIDIA AI infrastructure and x86 SoCs integrating RTX GPU chiplets for PC products. Through NVLink, the two companies form a coalition of 'AI Computing + NVIDIA CUDA + x86 Ecosystem'. This reshapes the AI chip supply chain landscape with far-reaching implications for AMD and independent chip designers.
Apple-Google Multi-Year Partnership Confirmed: Gemini to Power New Siri
Apple and Google confirm multi-year partnership with Google Cloud as preferred provider. Google is building a custom 1.2 trillion parameter Gemini model for Apple, 8x Apple's current cloud model. Siri will gain Gemini capabilities in 2026 with iOS 27. Privacy architecture unchanged—Gemini runs on Apple-controlled servers with data protection guarantees. Device compatibility limits exclude hundreds of millions of older iPhone users.
Meta Partners with AWS on Graviton
Meta partners with AWS to deploy tens of millions of Graviton5 cores, becoming one of the largest Graviton customers globally.
Google TPU v8 Launches: Single Cluster Breaks 40 ExaFLOPS
Google launches TPU v8 chip with 40+ ExaFLOPS single cluster capacity, supporting millions of concurrent agents, 3x compute density and 2x energy efficiency improvement.
Cerebras Launches IPO with $20B OpenAI Deal
AI chipmaker Cerebras filed for US IPO on Nasdaq with ticker CBRS; secured $20B multi-year deal with OpenAI to deploy 750MW of chips.
TSMC 2026 Outlook: AI Demand Drives 30%+ Revenue Growth, Advanced Process and Packaging Dual Constraints
Behind TSMC's revenue growth forecast is dual logic of 'volume and price both rising': AI chip demand drives shipment growth, advanced process scarcity pushes wafer unit prices up. But A16 process delay is a signal worth watching—even TSMC faces increasing difficulty in advanced process mass production.