Filter

×
Active Filters Clear All
Keyword: AI基础设施 ×
81 Total Reports
1/5 Page
NVIDIA Other 2026-06-12

NVIDIA and SK Hynix Lock Down HBM4/5 Roadmap, Cementing Vera Rubin Supply Chain

NVIDIA and SK Hynix sign a multi-year agreement to co-define HBM4 production and HBM5 pre-research for Vera Rubin GPUs. Samsung also enters HBM4 supply as a second source. The deal elevates SK Hynix from vendor to co-developer, potentially creating a de facto memory standard barrier that marginalizes Micron and others.

AMD Other 2026-06-12

AMD Backs All-Instinct GPU Cloud: TensorWave's $350M Series B Signals NVIDIA Ecosystem Breakout

TensorWave closes $350M Series B led by Magnetar and AMD Ventures at $1.55B valuation. The cloud is exclusively built on AMD Instinct GPUs (MI300X to MI455X), targeting memory-intensive AI workloads to offer a viable alternative to NVIDIA CUDA lock-in and validate ROCm software stack maturity in production.

Intel Other 2026-06-06

Intel Unveils Decoupled Inference Architecture and Xeon 6+, Partners with SambaNova and Foxconn for Rack-Scale AI Infrastructure

At Computex 2026, Intel unveiled three innovations: 1) Rack-scale AI infrastructure with SambaNova/Foxconn (production-ready); 2) World's first decoupled inference demo—Xeon 6 orchestrates, SN40 RDU decodes, Blackwell GPU prefill; Together.ai achieved fastest enterprise inference with MiniMax 2.5; 3) Xeon 6+—first Intel 18A data center CPU, 32U rack delivers 36,864 cores at ~100kW. Agent inference shifts CPU:GPU ratio from 1:4 toward 1:1.

Cisco Product Launch 2026-06-03

Cisco Cloud Control & AI Canvas: The Control Point Shifts from Hardware to the AI Decision Plane

At Cisco Live 2026, Cisco launched Cloud Control, an AI-ops platform with agentic workflows, and AI Canvas for human-agent collaboration. The platform leverages Splunk's data fabric and proprietary models trained on 40 years of Cisco data. The Silicon One architecture now unifies campus and cloud switches. This marks a strategic pivot from hardware vendor to AI platform, shifting the control point to the AI decision plane.

Microsoft Azure Product Launch 2026-06-03

Microsoft Maia 200 Mass-Produced, Cobalt 200 Previewed: AI Inference Control Shifts to Azure

At Build 2026, Microsoft announced mass production of Maia 200 AI inference chips, preview of Cobalt 200 ARM processors, and the MAI-Thinking-1 reasoning model (35B params). This signals a full-stack vertical integration to reduce NVIDIA dependency and lock Azure AI workloads.

NVIDIA Product Launch 2026-05-29

NVIDIA's Triple Play: Vera CPU, N1X Laptop Chip, and $6.5B Silicon Photonics Reshape AI Infra Control

NVIDIA delivers first agent-specific Vera CPU (88 Arm v9.2 cores, 1.2TB/s memory bandwidth), teases consumer N1X laptop chip, and invests $6.5B in silicon photonics. This shifts AI orchestration control from x86 to NVIDIA's Arm ecosystem, while CPO addresses memory wall, but volume production remains challenging until post-2028.

Intel Other 2026-05-25

Intel CEO: AI Inference Flips CPU/GPU Ratio, Multi-Agent Pushes CPU Back to Core

Intel CEO Lip-Bu Tan forecasts AI inference driving CPU/GPU ratio from 1:8 to 1:1 or even 4:1, with Multi-Agent demands (OS scheduling, KV Cache offload, high-concurrency tool calls) elevating CPU from supporting role to lead. NVIDIA Vera, AMD Venice, and Intel 18A CPU mass production confirm a CPU demand super-cycle.

Zscaler Other 2026-05-20

Zscaler's AI-Guardian Shifts Zero Trust Control Plane to Non-Human AI Identities

Zscaler launches Project AI-Guardian with six GSIs to extend Zero Trust to AI agents, introducing AI Protect suite. The core shift treats non-human identities as first-class security principals, enabling granular access control and continuous red-teaming for AI agent ecosystems.

Intel Other 2026-05-16

AI Agent Workloads Trigger Structural CPU Shortage, Arm and AMD Reshape Server Value Chain

AI inference and agent orchestration surge CPU demand, shifting CPU-GPU ratio from 1:8 to 1:1. AMD EPYC lead time 8-12 weeks, Intel Xeon up to 6 months; Arm's 3nm 136-core AGI processor co-developed with Meta/Cerebras/Cloudflare/OpenAI sees demand exceeding 200 billion USD. CPU replaces GPU as the new AI infrastructure bottleneck, with Arm and AMD reshaping the value chain.

NVIDIA Other 2026-05-16

NVIDIA CUDA Heap Overflow Exposes GPU Cloud Isolation Flaw: Driver-Level Security Must Move to Hardware

At Pwn2Own Berlin 2026, a heap overflow in NVIDIA CUDA Toolkit's NVVM compiler (CVE-2026-12839) enabled GPU cloud cross-tenant escape. The attack chain from malicious PTX to driver compromise to host kernel breaks current driver-level isolation, forcing a fundamental security architecture re-evaluation for shared GPU AI infrastructure.

Cisco Other 2026-05-16

Cisco AI Orders Surge to $9B, but SD-WAN Zero-Day for Third Year Reveals Systemic Security Gap

Cisco Q3 FY2026 raises AI infra order target to $9B, yet a CVSS 10.0 authentication bypass zero-day in SD-WAN Controller (CVE-2026-20182) is exploited by the same APT for the third consecutive year. This reveals a systemic gap in Cisco's security engineering as it pivots to AI, and a fundamental flaw in SD-WAN control plane architecture.

NVIDIA Partnership High Signal 2026-05-02

NVIDIA and Intel Announce $5 Billion Strategic Partnership: New AI Chip Supply Chain Landscape

NVIDIA and Intel announced a $5 billion strategic partnership on September 18, 2025: NVIDIA invests $5 billion for ~4% Intel stake, while Intel customizes x86 CPUs for NVIDIA AI infrastructure and x86 SoCs integrating RTX GPU chiplets for PC products. Through NVLink, the two companies form a coalition of 'AI Computing + NVIDIA CUDA + x86 Ecosystem'. This reshapes the AI chip supply chain landscape with far-reaching implications for AMD and independent chip designers.

NVIDIA Technology Update High Signal 2026-05-02

Global GPU Shortage to Persist Until 2027: Core Bottleneck for AI Infrastructure Expansion

Global GPU shortage expected to extend to 2027-2028, rooted in AI data center demand surge, constrained HBM production, CoWoS packaging tightness, and geopolitical risks. NVIDIA Rubin's mass production hindered (target reduced from 2M to 1.5M units), with Blackwell capturing 71% of high-end GPU shipments in 2026. Consumer RTX 5080/5070 Ti priced $200-$500 above MSRP, enterprise AI infrastructure procurement cycles will further extend.

Intel Other 2026-04-29

Intel Q1 Validates CPU:GPU 1:4 Ratio Trend: How Xeon 6 Reshapes TCO Calculation for AI Inference Infrastructure

Intel Q1 validates CPU:GPU ratio recovery from 1:8 to 1:4. Xeon 6 becomes NVIDIA DGX-Rubin CPU. AMX enables CPU to replace entry-level GPUs in inference reducing per-node TCO by 40-60%

Amazon Other 2026-04-21

Amazon Invests $5B in Anthropic, 10-Year $100B Cloud Deal

Amazon invests additional $5B in Anthropic with a 10-year $100B cloud commitment. Claude becomes the cornerstone of AWS Bedrock, directly challenging Microsoft-OpenAI alliance.

Meta Financial News Medium Signal 2026-04-19

Meta's 2026 Strategy: Labor-to-Compute Reallocation at Extreme Scale

Meta's strategic choice represents 'endgame thinking' in AI infrastructure arms race—not how to profit but how to survive. When capex reaches 50%+ of revenue, this is no longer a business decision but survival bet. The 'relative value' of labor costs has undergone fundamental revaluation in the AI era.

Amazon Market Expansion High Signal 2026-04-15

US AI Infrastructure Expansion Stalls: 30%-50% of 16GW Capacity Delayed

The US planned ~16GW data center capacity this year, with 30%-50% expected to face delays or cancellations, only ~5GW actually breaking ground. Power, supply chain, and workforce bottlenecks suppress AI infrastructure deployment.

Cisco Other High Signal 2026-04-10

Cisco Announces Intent to Acquire Galileo, Bolstering AI Observability and Trust

Cisco announces its intent to acquire Galileo, a startup specializing in AI observability. This move aims to deeply integrate observability, reliability, and safety for AI systems into Cisco's technology platform, signaling an expansion from general IT observability to a dedicated trust and assurance layer for AI infrastructure.

Intel Other High Signal 2026-04-09

Intel and Google Deepen Collaboration on CPU and IPU for Heterogeneous AI Infrastructure

Intel and Google announced a multi-year collaboration to advance next-generation AI and cloud infrastructure through aligned Xeon processor roadmaps and expanded co-development of custom ASIC-based IPUs. This reinforces the central role of CPUs in AI system orchestration and the critical value of IPUs in offloading infrastructure tasks to improve efficiency at hyperscale.

Cisco Other Medium Signal 2026-04-09

Cisco Uses Own Retail Stores as Testbed for Unified Data and AI Infrastructure

Cisco is using its branded retail stores as a testbed, employing Splunk as a unified data platform to integrate diverse data streams from Meraki sensors, POS, and video analytics. This moves operations from reactive monitoring to predictive intelligence, validating the integration of its tech stack in physical retail and paving the way for future AI-driven interactive experiences and Wi-Fi 7 deployments.