compute - AI Infrastructure Intelligence Search

Meta Other 2026-07-29

Meta and BlackRock Form $14B JV for 1GW AI Data Center, Meta Leases Compute to Anthropic

Meta and BlackRock announce a $14B joint venture for a 1GW AI data center in El Paso. BlackRock holds 80% ownership; Meta retains 20% and is in talks to lease compute to Anthropic for up to $10B, positioning Meta as an AI cloud provider.

AMD Other 2026-07-29

AMD Reports Inference at 60% of AI Workloads, Launches Embedded AI Chip, Secures Meta 6GW Deal

At AMD Advancing AI 2026, CEO Lisa Su reported inference now accounts for 60% of AI workloads. AMD launched Ryzen AI Embedded X100 with CPU/GPU/NPU for physical AI, and announced a 6GW multi-generational Instinct GPU agreement with Meta, plus powering the US Sovereign AI Factory with MI355X, EPYC, and Pensando.

ARM Other 2026-07-29

Arm's AGI CPU Targets x86 Dominance with 3x Performance Per Watt in AI Datacenters

Arm Neoverse dominates TOP500 and Green500, announces first custom AGI CPU for gigawatt-scale AI datacenters as central orchestrator. Claims 3x performance per watt over x86, signaling a control plane shift towards Arm. Also launches Arm Performix analysis tool.

Apple Other 2026-07-28

苹果M7芯片跳过M6代际直指端侧AI推理加速

...

NVIDIA Other 2026-07-28

NVIDIA Invests $5B in SSI, Opens Vera Rubin Platform to Lock In AI Safety Research

NVIDIA makes a major equity investment in Safe Superintelligence (SSI) and provides access to its next-generation Vera Rubin GPU platform. The partnership goes beyond hardware sales, giving NVIDIA rare access to SSI's confidential research, with insights feeding back into NVIDIA's platform roadmap, marking a strategic shift from hardware vendor to deep research partner.

Microsoft Azure Other 2026-07-26

Microsoft Azure Integrates AMD Helios Rack-Scale AI Platform, Breaks NVIDIA GPU Monopoly

Microsoft Azure announces deployment of AMD Helios rack-scale AI platform in H2 2026. Rack integrates 72 MI455X GPUs, 18 Venice CPUs, liquid cooling, delivering 2.9 Exaflops FP4 inference. This signals a major industry shift away from NVIDIA GPU monopoly towards multi-vendor heterogeneous AI infrastructure.

AMD Other 2026-07-26

AMD Launches Helios Rack-Scale AI Platform with MI400 GPUs, Targeting Inference TCO

At Advancing AI 2026, AMD unveiled the Helios rackscale platform integrating 72 MI455X GPUs and 18 EPYC Venice CPUs per rack, delivering 2.9 exaflops FP4 inference and 31TB HBM4 memory. The MI430X offers 288 TFLOPS FP64 for HPC. AMD claims up to 30% more inference tokens per dollar vs. competitors.

Anthropic Other 2026-07-26

Anthropic partners with SpaceX for 220K GPUs, shifting AI compute ecosystem beyond hyperscalers

Anthropic signs a deal with SpaceX for 300MW capacity and 220,000 NVIDIA GPUs at Colossus 1 data center, with exploration of orbital AI compute. This diversifies Anthropic's compute sources beyond hyperscalers and doubles Claude Code usage limits.

Meta Other 2026-07-26

Meta Transforms into AI Compute Landlord with $10B Anthropic Lease Deal

Meta is in early talks with Anthropic for a $10 billion compute lease deal, marking its strategic shift from internal AI infrastructure user to commercial landlord. This move monetizes Meta's massive capex and creates an inter-cloud leasing model, fundamentally reshaping the AI compute supply chain.

AMD Other 2026-07-24

AMD Helios Rack Challenges NVIDIA NVLink with Open UALoE Interconnect

At Advancing AI 2026, AMD launched the Helios rack with 72 MI455X GPUs, 18 Venice EPYC CPUs, and Pensando networking, claiming 30% higher inference token/$ vs NVIDIA NVL72. It introduced UALoE open interconnect to break NVLink lock-in, partnering with Cerebras, Cisco, and major AI firms.

Microsoft Other 2026-07-24

Microsoft launches MAI-Image-2.5-Pro and MAI-Voice-2-Flash, deepening in-house AI ecosystem

Microsoft introduces MAI-Image-2.5-Pro and MAI-Voice-2-Flash, proprietary models now powering Bing, PowerPoint, OneDrive, and Dynamics 365, replacing third-party models. Claims up to 84% GPU cost reduction and 2x faster voice, signaling a strategic shift to in-house AI.

Hewlett Packard Enterprise Other 2026-07-22

HPE扩展Private Cloud AI产品线，集成NVIDIA Vera Rubin NVL72平台

...

NVIDIA Other 2026-07-22

NVIDIA and Wistron Open US Factory for GB300 and Vera Rubin AI Superchips

Wistron opens its first US manufacturing facility in Fort Worth, producing NVIDIA GB300 Grace Blackwell Ultra and Vera Rubin superchips. The $700M plant aims for tens of thousands of boards monthly, marking NVIDIA's strategic shift to domestic AI hardware production.

NVIDIA Other 2026-07-21

NVIDIA Vera Rubin Platform Specs Revealed: 10x Tokens per Watt, Monolithic CPU+GPU Design

NVIDIA unveiled Vera Rubin platform specs with a monolithic design pairing 2 Rubin GPUs with 1 Vera CPU, flagship NVL72 integrating 36 CPUs and 72 GPUs. Claims 10x tokens per watt and 3x memory bandwidth over Grace Blackwell. Vera CPU sold standalone. First customers: Microsoft, OpenAI, Oracle. Mass production H2 2026. Performance claims await independent verification.

NVIDIA Other 2026-07-20

NVIDIA Invests $2B in CoreWeave, Debuts 'Compute Central Bank' Model

NVIDIA invests $2 billion in CoreWeave and launches the AI Compute Partner Program, featuring credit enhancement, revenue sharing, and GPU buyback. This transforms NVIDIA from a hardware vendor into a 'compute central bank', tightening control over the AI cloud leasing ecosystem and squeezing intermediaries.

Meta Other 2026-07-20

Meta Launches Muse Spark 1.1 API at 25% Competitor Price, Ends Open-Source Era

Meta releases Muse Spark 1.1, a multimodal reasoning model with 1M token context window, and launches its first paid API at 25% of competitors' price. This ends the Llama open-source era, signaling a strategic shift to proprietary API monetization and aggressive market share capture.

Hewlett Packard Enterprise Other 2026-07-20

HPE Expands Private Cloud AI with NVIDIA Vera Rubin, Enabling Agent-Native AI Factory

HPE expands its Private Cloud AI line with NVIDIA Vera Rubin NVL72 and HGX Rubin NVL8, introducing Compute XD700, Cray GX240 blade with Vera CPU, and Quantum-X800 InfiniBand. New software includes Agent Toolkit and NemoClaw for agent-native AI, with Alletra Storage MP X10000 in Q4 2026.

ARM Other 2026-07-19

ARM Launches AGI CPU, Achieves 3x Performance Per Watt, Tops Supercomputing Rankings

At ISC 2026, ARM announced the Armv9-based LineShine supercomputer as the first to exceed 2 exaflops, topping the TOP500. It also launched the AGI CPU with 136 Neoverse V3 cores for gigawatt-scale AI datacenters, with Neoverse systems achieving 3x performance per watt over x86 and leading the Green500.

Hewlett Packard Enterprise Other 2026-07-19

AMD and HPE Launch Helios Open AI Infrastructure to Rival NVIDIA Ecosystem

AMD and HPE expand partnership to launch Helios, an open-stack AI infrastructure platform integrating EPYC CPUs, Instinct MI455X GPUs, Pensando networking, and ROCm software. Each rack delivers up to 2.9 exaFLOPS FP4, built on OCP principles with Juniper switches, targeting simplified deployment and energy efficiency.

NVIDIA Other 2026-07-16

NVIDIA Jetson Thor T3000/T2000: Blackwell GPU Crashes Edge AI Cost Barrier

NVIDIA unveils Jetson Thor T3000 and T2000 modules. The T3000 packs a Blackwell GPU and 8-core Neoverse CPU, delivering 865 FP4 TFLOPS at half the power of the T5000. New Jetson Agent Skills automate memory optimization, aiming to scale deployment of humanoid robots and edge AI.

Reports

Filter