ARM - AI Infrastructure Intelligence Search

Cisco Other 2026-07-13

Cisco Launches Cloud Control and AgenticOps to Consolidate Network Management

At Cisco Live 2026, Cisco unveiled Cloud Control to unify Meraki, Catalyst Center, Nexus Dashboard, Security Cloud Control, and Splunk, along with AgenticOps for AI-driven network automation. Concurrently, it laid off 471 employees to align with an AI-first strategy, shifting from hardware sales to operational subscriptions and creating vendor lock-in.

TSMC Other 2026-07-13

TSMC CoWoS Capacity to Reach 200k Wafers by 2027, Diversifying from GPU to CPU and ASIC

TSMC targets 200k wpm CoWoS capacity by 2027, narrowing supply-demand gap from 20% to 10%. Customer base diversifies from NVIDIA GPU to include AI server CPUs (MediaTek, AMD) and ASICs (Broadcom). CoPoS panel-level packaging enters pilot production in 2027.

AMD Other 2026-07-10

Towards Feature Complete Triton Support in JAX-Triton â ROCm Blogs

...

NVIDIA Other 2026-07-08

NVIDIA Rigel Core: Single-Threaded CPU as the New Control Plane for Agentic AI

NVIDIA unveils Rosa CPU architecture with custom Rigel core (Arm v9.2), targeting single-threaded performance for Agentic AI workloads, paired with Feynman GPU (1.6nm, 50 PFLOPS) in 2028. This shifts CPU design from core-count scaling to serial-latency optimization, directly challenging AMD EPYC and Intel Xeon dominance.

NVIDIA Other 2026-07-07

NVIDIA Vera CPU获Perplexity/OpenAI/Anthropic/Oracle采用 AI Agent性能验证1.5-1.9x加速

...

NVIDIA Other 2026-07-07

NVIDIA Vera CPU: Max Single-Threaded Performance at Scale for Agentic AI

NVIDIA launches Vera CPU, a max single-threaded CPU at scale for agentic AI. With Olympus cores delivering 1.8x sustained per-core performance over x86, 1.2TB/s LPDDR5X bandwidth, and 3.4TB/s core-to-core bandwidth, Vera integrates into NVIDIA's unified AI factory architecture, aiming to lock users into its ecosystem.

NVIDIA Other 2026-07-07

AI Innovators Adopt NVIDIA Vera — Why Max Single-Threaded CPU at Scale Matters

...

Qualcomm Other 2026-06-26

Qualcomm Acquires Modular for $3.9B, Open-Sources Mojo to Break CUDA Lock-In

Qualcomm acquires Modular for $3.9B in stock and open-sources Mojo, a Python-compatible systems language. Mojo targets CUDA dependency, aiming to provide a high-performance alternative for AI developers. This move strengthens Qualcomm's AI inference chip software stack and edge AI competitiveness.

Huawei Other 2026-06-25

Huawei Unveils AI-Centric Network with Token Monetization, UCM Caching Breaks Long-Context Barriers

At MWC Shanghai 2026, Huawei unveiled an AI-native network architecture integrating service, network, and compute, shifting from traffic-centric to intelligence-centric operations. The Unified Cache Manager (UCM) extends KV cache to petabyte-scale external storage, achieving 372% token throughput gains on GLM-5.1 at 128K sequence lengths. Token monetization frameworks and agentic operations enable carriers to charge for AI inference capacity and personalize services.

NVIDIA Other 2026-06-25

Qualcomm Dragonfly: 250-core CPU, HBC memory, UALink interconnects target AI inference TCO

Qualcomm unveils full data center portfolio: Dragonfly C1000 250-core Oryon CPU (>5GHz, PCIe Gen7, CXL), HBC near-memory compute (133TB/s Gen1, 18x-54x effective BW), AI300 inference accelerator (UALink/ESUN scale-up), and 800G/1.6T connectivity. Multi-year Meta CPU deal. Commercial sampling 2027-2028. Targets inference TCO with tokens-per-watt leadership.

ARM Other 2026-06-24

China's LineShine Tops TOP500: CPU-Only 2.2 ExaFLOPS with ARMv9 and HBM Memory

LineShine supercomputer achieves 2.198 ExaFLOPS FP64 sustained using 13.79 million ARMv9 cores across 20,480 nodes, making it the first system to exceed 2 ExaFLOPS without GPUs. Each node has dual LX2 CPUs (304 cores) with 32GB HBM, demonstrating a CPU+HBM architecture breakthrough for HPC.

Microsoft Other 2026-06-23

Microsoft Launches Azure Copilot Observability Agent to Lock Ops Control Plane

Microsoft announces GA of Azure Copilot Observability Agent, built on Azure Monitor. It correlates signals across agents, apps, infrastructure, and services to provide unified operational context. This move aims to lock AI-driven incident diagnosis and remediation workflows deeply within the Azure ecosystem.

NVIDIA Other 2026-06-23

NVIDIA Unveils 45°C Liquid Cooling for Rubin Chips, Slashes Water Use 100%

NVIDIA announces a liquid cooling system for its Rubin GPUs running 45°C coolant (hotter than a hot tub), using dry coolers in a closed loop to cut electricity and eliminate water evaporation (100% reduction). However, chillers may still be needed in hot climates, and chip longevity impacts remain unaddressed.

NVIDIA Other 2026-06-23

NVIDIA Vera Rubin NVL4: CPU-GPU Fusion Locks Supercomputing Architecture

NVIDIA announces the Vera Rubin NVL4 supercomputing platform, integrating the Rubin GPU and Vera CPU via NVLink and InfiniBand for end-to-end acceleration, delivering over 7 exaflops of AI compute. The ARM-based Vera CPU marks a strategic deepening in data center CPUs, with availability expected in Q4 2026.

ARM Other 2026-06-23

Arm Server Share Hits 45%: NVIDIA's Bundling Strategy Reshapes AI Infrastructure

IDC data shows Arm-based servers now hold over 45% of the global server market, driven by NVIDIA's bundling of its Arm-based Vera CPU with GPU systems like NVL72 and Rubin. x86 share shrinks to 52%, while accelerated systems contribute over 70% of revenue. ODM direct sales account for 50.2%, with Dell revenue growing 244.1% YoY.

NVIDIA Other 2026-06-23

NVIDIA Vera Rubin NVL4: Custom ARM CPU and NVLink Converge to Dominate HPC+AI

NVIDIA unveils the Vera Rubin platform, integrating a custom Vera CPU (ARM) and Rubin GPU via NVLink and liquid cooling, delivering >7 exaflops AI and ~5 PF FP64. Targeting HPC+AI convergence at 144 GPUs per rack, it redefines the compute density standard, shipping Q4 2026.

AMD Other 2026-06-23

AMD MI430X GPU Delivers >200 TFLOPS Native FP64, Reshaping HPC-AI Convergence Baseline

AMD powers 4 of top 10 TOP500 supercomputers and previews MI430X GPU with >200 TFLOPS native FP64. This targets AI-for-science workloads, making double-precision compute a key metric for converged HPC-AI infrastructure, directly challenging NVIDIA and Intel.

Amazon Other 2026-06-23

AWS Lambda MicroVMs: Stateful Isolated Sandboxes via Firecracker Snapshots

AWS launches Lambda MicroVMs, leveraging Firecracker for VM-level isolation, near-instant launch/resume, and stateful execution. Users build images from Dockerfiles in S3, launch from pre-initialized snapshots, and suspend/resume automatically, enabling multi-tenant AI code sandboxes and interactive analytics.

ARM Other 2026-06-23

Arm servers capture >45% data center revenue, x86 ecosystem under AI-driven assault

IDC reports Q1 2026 global server revenue hit a record $122.6B, with Arm-based servers capturing >45% share (x86 at 52%). Accelerated servers (GPU/ASIC/FPGA) generated >70% revenue. Nvidia's Grace CPU (NVL72) and hyperscaler custom Arm chips drive the shift; x86 still leads in unit volume but faces supply constraints.

ASML Other 2026-06-23

ASML CEO Validates Musk's Terafab, Reshaping AI Chip Supply Chain

ASML's CEO publicly acknowledges tracking Elon Musk's planned terawatt-scale AI supercomputer Terafab, comparing it to Korean DRAM megaprojects. This signals that the sole EUV lithography supplier is allocating capacity, potentially transforming AI chip supply chain and vertical integration.

Reports

Filter

Cisco Launches Cloud Control and AgenticOps to Consolidate Network Management

TSMC CoWoS Capacity to Reach 200k Wafers by 2027, Diversifying from GPU to CPU and ASIC

Towards Feature Complete Triton Support in JAX-Triton â ROCm Blogs

NVIDIA Rigel Core: Single-Threaded CPU as the New Control Plane for Agentic AI

NVIDIA Vera CPU获Perplexity/OpenAI/Anthropic/Oracle采用 AI Agent性能验证1.5-1.9x加速

NVIDIA Vera CPU: Max Single-Threaded Performance at Scale for Agentic AI

AI Innovators Adopt NVIDIA Vera — Why Max Single-Threaded CPU at Scale Matters

Qualcomm Acquires Modular for $3.9B, Open-Sources Mojo to Break CUDA Lock-In

Huawei Unveils AI-Centric Network with Token Monetization, UCM Caching Breaks Long-Context Barriers

Qualcomm Dragonfly: 250-core CPU, HBC memory, UALink interconnects target AI inference TCO

China's LineShine Tops TOP500: CPU-Only 2.2 ExaFLOPS with ARMv9 and HBM Memory

Microsoft Launches Azure Copilot Observability Agent to Lock Ops Control Plane

NVIDIA Unveils 45°C Liquid Cooling for Rubin Chips, Slashes Water Use 100%

NVIDIA Vera Rubin NVL4: CPU-GPU Fusion Locks Supercomputing Architecture

Arm Server Share Hits 45%: NVIDIA's Bundling Strategy Reshapes AI Infrastructure

NVIDIA Vera Rubin NVL4: Custom ARM CPU and NVLink Converge to Dominate HPC+AI

AMD MI430X GPU Delivers >200 TFLOPS Native FP64, Reshaping HPC-AI Convergence Baseline

AWS Lambda MicroVMs: Stateful Isolated Sandboxes via Firecracker Snapshots

Arm servers capture >45% data center revenue, x86 ecosystem under AI-driven assault

ASML CEO Validates Musk's Terafab, Reshaping AI Chip Supply Chain

Reports

Filter

Cisco Launches Cloud Control and AgenticOps to Consolidate Network Management

TSMC CoWoS Capacity to Reach 200k Wafers by 2027, Diversifying from GPU to CPU and ASIC

Towards Feature Complete Triton Support in JAX-Triton â ROCm Blogs

NVIDIA Rigel Core: Single-Threaded CPU as the New Control Plane for Agentic AI

NVIDIA Vera CPU获Perplexity/OpenAI/Anthropic/Oracle采用 AI Agent性能验证1.5-1.9x加速

NVIDIA Vera CPU: Max Single-Threaded Performance at Scale for Agentic AI

AI Innovators Adopt NVIDIA Vera — Why Max Single-Threaded CPU at Scale Matters

Qualcomm Acquires Modular for $3.9B, Open-Sources Mojo to Break CUDA Lock-In

Huawei Unveils AI-Centric Network with Token Monetization, UCM Caching Breaks Long-Context Barriers

Qualcomm Dragonfly: 250-core CPU, HBC memory, UALink interconnects target AI inference TCO

China's LineShine Tops TOP500: CPU-Only 2.2 ExaFLOPS with ARMv9 and HBM Memory

Microsoft Launches Azure Copilot Observability Agent to Lock Ops Control Plane

NVIDIA Unveils 45°C Liquid Cooling for Rubin Chips, Slashes Water Use 100%

NVIDIA Vera Rubin NVL4: CPU-GPU Fusion Locks Supercomputing Architecture

Arm Server Share Hits 45%: NVIDIA's Bundling Strategy Reshapes AI Infrastructure

NVIDIA Vera Rubin NVL4: Custom ARM CPU and NVLink Converge to Dominate HPC+AI

AMD MI430X GPU Delivers >200 TFLOPS Native FP64, Reshaping HPC-AI Convergence Baseline

AWS Lambda MicroVMs: Stateful Isolated Sandboxes via Firecracker Snapshots

Arm servers capture >45% data center revenue, x86 ecosystem under AI-driven assault

ASML CEO Validates Musk's Terafab, Reshaping AI Chip Supply Chain

Towards Feature Complete Triton Support in JAX-Triton â ROCm Blogs