compute - AI Infrastructure Intelligence Search

NVIDIA Other 2026-06-17

NVIDIA and Coherent Scale 6-Inch InP Fab, Optical Interconnect Becomes AI Infrastructure's New Bottleneck Breaker

NVIDIA invests $2B and commits multi-billion purchases to Coherent's expanded 6-inch indium phosphide fab in Texas, scaling production of lasers and optical modules for AI interconnects. This addresses copper's distance and power limitations in large GPU clusters (e.g., Vera Rubin Ultra NVL576), pushing co-packaged optics into volume manufacturing.

NVIDIA Other 2026-06-17

NVIDIA ACE Goes Local: Control Shifts from Cloud to RTX GPU for Game AI

NVIDIA launches ACE Game Agent SDK (open-source C/C++ framework) and UE5 plugins (ASR/SLM/TTS), moving AI NPC inference fully on-device via GeForce RTX. DLSS 4.5 plugin adds multi-frame generation. This shifts control from cloud providers to NVIDIA GPU ecosystem, but masks hardware lock-in and local model limitations.

AMD Other 2026-06-17

AMD MLPerf 6.0: MI350 GPUs Achieve 3.5x Leap with MXFP4, Debut Multi-Node Training

AMD submitted its most comprehensive MLPerf Training 6.0 results, including first multi-node training (FLUX.1 on 512 GPUs) and MXFP4 training recipe. MI355X delivers 3.5x generational leap over MI300X on Llama 2-70B, within 5% of NVIDIA B200. 10 ecosystem partners validated reproducibility.

NVIDIA Other 2026-06-17

NVIDIA and HPE Expand AI Factory with Vera CPU for Agentic AI, Full-Stack Integration

NVIDIA and HPE expand the HPE AI Factory with the Vera CPU, the first CPU built for agentic AI, plus the NVIDIA Agent Toolkit, Confidential Computing, and full-stack NVIDIA integration (Spectrum-X, BlueField, ConnectX). This turnkey solution targets enterprise agentic AI production, locking customers into NVIDIA's hardware-software stack.

NVIDIA Other 2026-06-16

NVIDIA Blackwell Sweeps MLPerf: NVLink and NVFP4 Redefine AI Training Economics

NVIDIA Blackwell dominates MLPerf Training 6.0, submitting across all seven benchmarks including MoE workloads. GB300 NVL72 delivers up to 1.6x faster training than GB200, with fifth-gen NVLink unifying 72 GPUs as one giant GPU. NVFP4 low-precision training and massive scale (8,192 GPUs) set new industry standards.

Microsoft Other 2026-06-16

Microsoft Agent 365: Control Plane Lock Replaces Model Lock, Building an Entra Empire for AI

Microsoft launches Agent 365 as a unified control plane for AI agents, integrating Entra, Defender, Purview, Intune, and cost management, alongside the Microsoft IQ semantic platform. While claiming model diversity and openness, this effectively locks enterprise AI assets into Microsoft's management toolchain, shifting control from model layer to infrastructure layer.

NVIDIA Other 2026-06-16

SiMa.ai Palette Neat: Natural-Language Agentic Environment Dismantles NVIDIA's GPU Moat

SiMa.ai launches open-source Palette Neat, an agentic development environment for Physical AI, paired with its sub-10W Modalix SoM. It uses natural language to abstract compute complexity, slashing dev cycles from months to days. Pin-compatible with NVIDIA SoM, it targets breaking the GPU ecosystem lock-in.

Hewlett Packard Enterprise Other 2026-06-16

HPE Nonstop Embeds Agentic AI for Fraud: Control Shifts to Proprietary Inference Engine

HPE integrates Lusis TANGO AIF into Nonstop Compute, embedding Random Forest and deep learning models for real-time, adaptive anti-fraud operations. The solution offers self-healing infrastructure and linear scalability, shifting fraud detection from rule-based engines to AI-driven inference within the proprietary Nonstop environment.

Hewlett Packard Enterprise Other 2026-06-16

HPE Expands Self-Driving Networks: AI Control Plane Unifies Juniper & Aruba, Locks Management Stack

HPE integrates Juniper networking into its AI Data Center Solution, expanding self-driving networks across edge, campus, DC, and AI factories. New Mist support for CX switches, Marvis AIOps in Aruba Central, and QFX switches optimized for inferencing. Unified SASE platform aims to simplify operations via agentic AI automation, consolidating control under a single AI management plane.

AMD Other 2026-06-16

AMD Critical RCE Vulnerability Disclosed After 124 Days, Sparks AI Infrastructure Security Crisis

Security researcher mr.bruh publicly disclosed a critical remote code execution (RCE) vulnerability in AMD processors after 124 days without a fix, with AMD refusing a $10,000 bounty. The flaw affects AI servers running AMD EPYC and Instinct, likened to a Log4j moment for AI infrastructure, forcing enterprises to reassess chip-level security response and supply chain risk.

MediaTek Other 2026-06-16

MediaTek Doubles AI ASIC Target to $2B, Challenges Broadcom in Data Center Custom Silicon

MediaTek doubles its 2026 AI ASIC revenue target to $2B, leveraging Google hyperscaler deals and the NVIDIA RTX Spark chip (featuring MediaTek's N1X Arm CPU). It aims for 10-15% of the $70-80B custom AI chip market by 2027, directly challenging Broadcom's dominance.

AMD Other 2026-06-16

AMD and Rackspace Deploy 30MW Governed AI Stack: Ecosystem Restructuring from Silicon to Outcomes

AMD and Rackspace sign a definitive agreement to deploy 30MW of AMD AI compute (Instinct GPUs including MI355X, EPYC CPUs) across Rackspace's data centers, creating a governed enterprise AI stack with single accountability from silicon to outcomes, targeting regulated industries.

Google Other 2026-06-16

Google Open-Sources Brazos: Plug-and-Play Liquid Cooling for Air-Cooled DCs

Google introduces Brazos, a rack-mounted closed-loop liquid-to-air cooling system for existing air-cooled data centers. Supporting 60kW per rack, it is open-sourced via OCP, enabling high-density AI/HPC deployments without facility retrofits.

AMD Other 2026-06-16

AMD Ryzen 10000 Series to Swap iGPU for NPU: AI Boost at Cost of Basic Display

Leaks suggest AMD's next-gen Zen 6 desktop CPU 'Olympic Ridge' will replace the integrated GPU with an NPU, targeting >40 TOPS for Copilot+ AI PC certification. It also upgrades the client I/O die to support CUDIMM/CAMM and EXPO 1.2 for faster DDR5. The trade-off boosts local AI but forces nearly all users to rely on a discrete GPU for basic display.

AMD Other 2026-06-15

AMD Acquires MEXT: AI-Predicted Flash Nears DRAM Performance to Cut AI Memory TCO

AMD acquires MEXT, an AI-driven memory optimization startup. MEXT's predictive technology makes NAND Flash behave like DRAM, expanding effective memory capacity for AI workloads and lowering TCO. The tech will be integrated across AMD's data center portfolio (EPYC, Instinct) to address memory bottlenecks in large models.

NVIDIA Other 2026-06-15

NVIDIA Bets on World-Action Models: Control Shifts from VLM to Video Backbones

NVIDIA's blog introduces World-Action Models (WAMs) as a paradigm shift from VLM-based VLAs. WAMs leverage pretrained video/world-model backbones to jointly predict future states and robot actions, aiming to bridge the language-to-action grounding gap. This could redefine robot foundation model training but raises concerns about inference cost and latency.

NVIDIA Other 2026-06-15

NVIDIA's Desktop DGX Station with GB300 Shifts Control from Cloud to Local Hardware

ASUS launches ExpertCenter Pro ET900N G3, built on NVIDIA DGX Station GB300 architecture with GB300 Grace Blackwell Ultra chip, 748GB coherent memory, and 20 PFLOPS AI performance. This deskside AI supercomputer enables local LLM fine-tuning, inference, and agentic AI workflows via NVLink-C2C and the full NVIDIA AI software stack including NemoClaw.

MediaTek Other 2026-06-15

Compute Futures Market: Financializing GPU Capacity Could Reshape AI Infrastructure Procurement

Carmen Li is building a GPU pricing index and spot marketplace via Silicon Data and Compute Exchange, aiming to launch compute futures. Backed by DRW, this initiative targets GPU price volatility by standardizing compute trading, potentially creating a trillion-dollar asset class and transforming AI compute procurement.

Cloudflare Other 2026-06-15

Cloudflare Absorbs Ensemble AI: Architectural Model Compression Reshapes Edge Inference Economics

Cloudflare integrates key Ensemble AI talent, bringing NdLinear and NdLinear-LoRA—architectural model compression techniques that preserve multidimensional activations to reduce parameters and compute. This aims to slash inference costs on Workers AI, boost GPU utilization, and accelerate global edge AI deployment.

NVIDIA Other 2026-06-14

NVIDIA & SK hynix Deepen Memory Co-Engineering: Custom HBM for Vera Rubin and Jetson Thor

NVIDIA and SK hynix have announced a multiyear partnership to co-develop next-generation custom memory for NVIDIA's AI factory ecosystem, including Vera Rubin supercomputers, Vera CPUs, RTX Spark PCs, and Jetson Thor robotic platforms. SK hynix will also use NVIDIA CUDA-X libraries and Omniverse to accelerate semiconductor design and build fab digital twins.

Reports

Filter

NVIDIA and Coherent Scale 6-Inch InP Fab, Optical Interconnect Becomes AI Infrastructure's New Bottleneck Breaker

NVIDIA ACE Goes Local: Control Shifts from Cloud to RTX GPU for Game AI

AMD MLPerf 6.0: MI350 GPUs Achieve 3.5x Leap with MXFP4, Debut Multi-Node Training

NVIDIA and HPE Expand AI Factory with Vera CPU for Agentic AI, Full-Stack Integration

NVIDIA Blackwell Sweeps MLPerf: NVLink and NVFP4 Redefine AI Training Economics

Microsoft Agent 365: Control Plane Lock Replaces Model Lock, Building an Entra Empire for AI

SiMa.ai Palette Neat: Natural-Language Agentic Environment Dismantles NVIDIA's GPU Moat

HPE Nonstop Embeds Agentic AI for Fraud: Control Shifts to Proprietary Inference Engine

HPE Expands Self-Driving Networks: AI Control Plane Unifies Juniper & Aruba, Locks Management Stack

AMD Critical RCE Vulnerability Disclosed After 124 Days, Sparks AI Infrastructure Security Crisis

MediaTek Doubles AI ASIC Target to $2B, Challenges Broadcom in Data Center Custom Silicon

AMD and Rackspace Deploy 30MW Governed AI Stack: Ecosystem Restructuring from Silicon to Outcomes

Google Open-Sources Brazos: Plug-and-Play Liquid Cooling for Air-Cooled DCs

AMD Ryzen 10000 Series to Swap iGPU for NPU: AI Boost at Cost of Basic Display

AMD Acquires MEXT: AI-Predicted Flash Nears DRAM Performance to Cut AI Memory TCO

NVIDIA Bets on World-Action Models: Control Shifts from VLM to Video Backbones

NVIDIA's Desktop DGX Station with GB300 Shifts Control from Cloud to Local Hardware

Compute Futures Market: Financializing GPU Capacity Could Reshape AI Infrastructure Procurement

Cloudflare Absorbs Ensemble AI: Architectural Model Compression Reshapes Edge Inference Economics

NVIDIA & SK hynix Deepen Memory Co-Engineering: Custom HBM for Vera Rubin and Jetson Thor