NVLink - AI Infrastructure Intelligence Search

NVIDIA Other 2026-07-31

NVIDIA Vera Rubin Goes Global: 10x Token per Megawatt, Locks In AI Factory Standard

NVIDIA announces full production and global delivery of Vera Rubin platform, with CoreWeave and cloud providers deploying NVL72 systems. Featuring Vera CPU and Rubin GPU, the platform delivers 10x token throughput per megawatt over Blackwell, enabling gigawatt-scale AI factories across 350+ sites.

NVIDIA Other 2026-07-29

NVIDIA Rubin GPU Detailed: 3nm Dual-Die, 336B Transistors, 288GB HBM4, NVLink 6 Doubles Bandwidth

NVIDIA unveiled the full Rubin GPU architecture at SIGGRAPH 2026: 3nm dual-die, 336B transistors, 288GB HBM4 with 22 TB/s bandwidth, and NVLink 6 at 3600 GB/s. The NVL72 rack integrates 72 GPUs with 36 Vera CPUs, requiring full liquid cooling due to >1000W TDP.

AMD Other 2026-07-26

AMD Helios Enters Production: 12-Stack HBM4 Outmuscles NVIDIA, UALoE Opens AI Network

AMD's second-generation Helios rack-scale AI server enters full production, featuring 72 MI455X GPUs with Samsung's exclusive 12-stack HBM4 (31TB per rack). Compared to NVIDIA's 8-stack design, it offers 50% more memory and 30% lower token cost. Microsoft Azure commits to large-scale deployment, solidifying hyperscaler dual-vendor strategy.

NVIDIA Other 2026-07-25

NVIDIA and Wistron Launch US-Based GB300 Production, Shifting AI Manufacturing Landscape

NVIDIA partnered with Wistron to open a $700M factory in Texas, achieving L6 integration of the GB300 Grace Blackwell Ultra system. The first US-made unit contains 1.5M parts, weighs 2 tons, and costs $4M. This milestone accelerates US AI capex localization from 5% to 30%, reshaping the global AI supply chain.

AMD Other 2026-07-24

AMD Helios Rack Challenges NVIDIA NVLink with Open UALoE Interconnect

At Advancing AI 2026, AMD launched the Helios rack with 72 MI455X GPUs, 18 Venice EPYC CPUs, and Pensando networking, claiming 30% higher inference token/$ vs NVIDIA NVL72. It introduced UALoE open interconnect to break NVLink lock-in, partnering with Cerebras, Cisco, and major AI firms.

Microsoft Other 2026-07-24

AMD Helios Full-Stack AI Server Deployed on Azure, UALoE Open Standard Challenges NVLink

AMD and Microsoft Azure announce large-scale deployment of Helios full-stack AI servers, featuring 72 MI455X GPUs, 18 EPYC Venice CPUs, and Pensando DPUs, with 31TB HBM4 and 1.7PB/s memory bandwidth. Two new Azure VM series target Agentic AI and semiconductor design, marking Microsoft's multi-vendor strategy and challenging NVIDIA's dominance.

NVIDIA Other 2026-07-23

NVIDIA-OpenAI $100B Partnership: 10GW Vera Rubin AI Factories Reshape Ecosystem

NVIDIA and OpenAI announce a strategic partnership to deploy at least 10GW of NVIDIA systems using the Vera Rubin platform (Rubin GPU, Vera CPU, HBM4, NVLink 6). NVIDIA will invest up to $100B. First facilities go online in H2 2026, powering OpenAI's next-gen models, marking the era of multi-GW AI factories.

NVIDIA Other 2026-07-23

TEST 2026-07-23 DailyShift 24h信号测试

...

NVIDIA Other 2026-07-22

NVIDIA公开Rubin GPU架构细节：3360亿晶体管，智能体AI性能较Blackwell提升10倍

...

NVIDIA Other 2026-07-22

NVIDIA Reveals Vera Rubin GPU and Vera CPU: 3360B Transistors, 88-Core Olympus, 10x Agentic AI Efficiency

NVIDIA fully discloses Vera Rubin GPU and Vera CPU specifications. The GPU features 3360B transistors, HBM4 288GB, and 10x agentic AI efficiency over Blackwell. The CPU has 88 custom Olympus cores, delivering 2.2x faster agentic AI performance than Intel Sapphire Rapids. This solidifies NVIDIA's full-stack strategy against x86 incumbents.

NVIDIA Other 2026-07-21

NVIDIA Spectrum-6 102.4Tbps Switch Goes Commercial, Cisco Adoption Confirms Bandwidth Inflection

NVIDIA announces Spectrum-6 102.4Tbps Ethernet switch for AI factories, doubling bandwidth with CPO and liquid cooling. Cisco confirms adoption in N9100 series, while Broadcom launches Tomahawk 6, signaling a terabit Ethernet race for AI infrastructure.

NVIDIA Other 2026-07-20

NVIDIA Agent Toolkit Shifts AI Agent Control from Cloud to Local DGX Station

NVIDIA launches Agent Toolkit for DGX Station, comprising NemoClaw, Nemotron 3 Ultra, Omniverse Libraries, and OpenShell. It enables local AI agent deployment in 30 minutes, locking developers into NVIDIA's hardware-software stack and shifting control from cloud services to on-premises hardware.

NVIDIA Other 2026-07-19

NVIDIA Vera Rubin Platform and Dynamo 1.0 Disaggregate Inference, Shift Focus to Intelligence per Dollar

NVIDIA unveils Vera Rubin platform with a 7-chip stack (Vera CPU, Rubin GPU, NVLink 6, etc.) and Dynamo 1.0 inference disaggregation. A single NVL72 rack packs 72 GPUs/36 CPUs with 1.6 PB/s bandwidth, achieving up to 7x inference performance. The new 'intelligence per dollar' metric signals a shift from training to inference cost competition.

Huawei Other 2026-07-17

Huawei Atlas 950 SuperPoD & 灵衢2.0: A Systemic Pivot in China's AI Compute from Chip to Cluster

At WAIC 2026, Huawei publicly demonstrated the Atlas 950 SuperPoD, a 1024-ascend NPU card cluster, and unveiled the 灵衢2.0 high-speed interconnect protocol. This signals a strategic shift in China's AI infrastructure from single-chip to system-level leadership, creating a closed-loop ecosystem that directly challenges NVIDIA's NVL series dominance.

NVIDIA Other 2026-07-06

NVIDIA Kyber NVL144 Delayed to 2028: Midplane PCB Manufacturing Becomes AI Scaling Bottleneck

SemiAnalysis reveals NVIDIA's Kyber NVL144 delayed beyond 12 months to 2028 due to 78-layer Orthogonal Backplane manufacturing challenges. The interim NVL72x2 solution is cancelled due to operational burdens, and the 4-die Rubin Ultra is also scrapped, leaving a product gap in NVIDIA's scaling roadmap.

Anthropic Other 2026-06-30

Anthropic Claude Goes Exclusive on Azure, Microsoft Locks AI Model Distribution via GB300

Anthropic's Claude models are now generally available on Azure Foundry, powered by NVIDIA GB300 NVL72 clusters with over 4600 Blackwell Ultra GPUs. Initial models include Opus 4.8 and Haiku 4.5 with prompt caching and extended thinking. Microsoft gains exclusive enterprise distribution, strengthening its competitive position against AWS and Google Cloud.

NVIDIA Other 2026-06-23

NVIDIA Vera Rubin NVL4: CPU-GPU Fusion Locks Supercomputing Architecture

NVIDIA announces the Vera Rubin NVL4 supercomputing platform, integrating the Rubin GPU and Vera CPU via NVLink and InfiniBand for end-to-end acceleration, delivering over 7 exaflops of AI compute. The ARM-based Vera CPU marks a strategic deepening in data center CPUs, with availability expected in Q4 2026.

NVIDIA Other 2026-06-23

NVIDIA Vera Rubin NVL4: Custom ARM CPU and NVLink Converge to Dominate HPC+AI

NVIDIA unveils the Vera Rubin platform, integrating a custom Vera CPU (ARM) and Rubin GPU via NVLink and liquid cooling, delivering >7 exaflops AI and ~5 PF FP64. Targeting HPC+AI convergence at 144 GPUs per rack, it redefines the compute density standard, shipping Q4 2026.

Google Cloud Other 2026-06-17

ASUS Launches NVIDIA GB300 Deskside AI Supercomputer, Shifting Control from Cloud to On-Prem

ASUS launches the ExpertCenter Pro ET900N G3, powered by NVIDIA's GB300 Grace Blackwell Ultra Desktop Superchip, delivering 20 PFLOPS and 748GB of coherent memory for near-trillion parameter models. Concurrently, Coherent expands InP fab in Texas for optical interconnects, and NVIDIA plans a $20-25B debt offering, signaling a systemic shift of AI control from cloud to localized enterprise hardware.

NVIDIA Other 2026-06-17

NVIDIA & Coherent Expand 6-Inch InP Fab, Locking AI Optical Interconnect Supply Chain

Coherent breaks ground on the world's first 6-inch indium phosphide fab in Texas, backed by $2B from NVIDIA and multi-billion purchase commitments. The facility produces lasers, transceivers, and pluggable optics for silicon photonics interconnects, enabling NVIDIA's Vera Rubin Ultra NVL576 576-GPU clusters and signaling a mass shift from copper to optical backbones in AI data centers.

Reports

Filter