infrastructure - AI Infrastructure Intelligence Search

NVIDIA Other 2026-06-16

NVIDIA Blackwell Sweeps MLPerf: NVLink and NVFP4 Redefine AI Training Economics

NVIDIA Blackwell dominates MLPerf Training 6.0, submitting across all seven benchmarks including MoE workloads. GB300 NVL72 delivers up to 1.6x faster training than GB200, with fifth-gen NVLink unifying 72 GPUs as one giant GPU. NVFP4 low-precision training and massive scale (8,192 GPUs) set new industry standards.

MediaTek Other 2026-06-16

HBM Bottleneck Reshapes AI Infrastructure: Asian Memory Makers Gain Leverage Over Nvidia

SK Hynix, Samsung, and Micron have crossed $1 trillion market cap as HBM becomes the hard limit in AI infrastructure. Asian suppliers now account for 90% of Nvidia's production costs, shifting the bottleneck from GPU compute to stacked memory and advanced packaging.

AMD Other 2026-06-16

AMD and Rackspace Deploy 30MW Governed AI Stack: Ecosystem Restructuring from Silicon to Outcomes

AMD and Rackspace sign a definitive agreement to deploy 30MW of AMD AI compute (Instinct GPUs including MI355X, EPYC CPUs) across Rackspace's data centers, creating a governed enterprise AI stack with single accountability from silicon to outcomes, targeting regulated industries.

Google Cloud Other 2026-06-16

Apple Rebuilds Siri with Google Gemini, Cuts Legacy Hardware Support

Apple rebuilds Siri using Google Gemini-derived capabilities, introducing five new AFM 3 foundation models (including a 20B-parameter multimodal on-device model). The move is paired with the sharpest hardware support cut in watchOS 27, limiting to S9/S10 chips, signaling a strategic shift from vertical integration to hybrid AI partnerships and accelerated hardware refresh cycles.

CrowdStrike Other 2026-06-16

CrowdStrike's Continuous Identity for AI Agents: Real-Time Risk Engine Replaces Static Policies

CrowdStrike launches Continuous Identity for AI Agents, assigning cryptographically verifiable identities via SPIFFE and authorizing every agent action based on owner, caller, and device risk in real time. It eliminates standing privileges, integrates with Falcon AIDR for permission misuse detection, and extends the identity security control plane across human, non-human, and AI identities.

Cisco Other 2026-06-16

Cisco Security Portfolio Moves to AWS Marketplace: Ecosystem Lock-in Accelerates, Multi-Cloud Neutrality Questioned

Cisco announces availability of its full SaaS security portfolio (Duo, Secure Access, Identity Intelligence, Hybrid Mesh Firewall) on AWS Marketplace, with deep integration with Amazon Bedrock and SageMaker for AI security and zero-trust agent management. This move simplifies procurement and accelerates deployment but deepens AWS dependency, potentially sacrificing multi-cloud flexibility.

Cloudflare Other 2026-06-15

Cloudflare Announces Scheduled Maintenance and Global Infrastructure Expansion

...

Cisco Other 2026-06-15

Cisco G300: A Lock-in Play for AI Network Control Plane Dominance

Cisco launches the Silicon One G300 programmable AI networking chip for AI data centers and ML clusters. It extends Cisco's unified routing, switching, and AI acceleration architecture, but fundamentally aims to lock users into a proprietary control plane, countering open ecosystems from Broadcom and Nvidia.

AMD Other 2026-06-15

AMD Acquires MEXT: AI-Predicted Flash Nears DRAM Performance to Cut AI Memory TCO

AMD acquires MEXT, an AI-driven memory optimization startup. MEXT's predictive technology makes NAND Flash behave like DRAM, expanding effective memory capacity for AI workloads and lowering TCO. The tech will be integrated across AMD's data center portfolio (EPYC, Instinct) to address memory bottlenecks in large models.

AMD Other 2026-06-15

AMD Open-Sources AI Software Stack on Vultr, Taking on NVIDIA CUDA Ecosystem

AMD launches a suite of open-source, modular enterprise AI software components on Vultr Marketplace, including AMD Inference Microservices (AIMs), AI Workbench, Resource Manager, and Solution Blueprints. This aims to provide production-grade AI infrastructure without vendor lock-in, directly challenging NVIDIA's CUDA ecosystem.

NVIDIA Other 2026-06-15

NVIDIA's Desktop DGX Station with GB300 Shifts Control from Cloud to Local Hardware

ASUS launches ExpertCenter Pro ET900N G3, built on NVIDIA DGX Station GB300 architecture with GB300 Grace Blackwell Ultra chip, 748GB coherent memory, and 20 PFLOPS AI performance. This deskside AI supercomputer enables local LLM fine-tuning, inference, and agentic AI workflows via NVLink-C2C and the full NVIDIA AI software stack including NemoClaw.

Research Other 2026-06-15

Z.ai GLM-5.2 Ships Usable 1M-Token Context, No Benchmarks, Two Thinking Levels

Z.ai releases GLM-5.2 with a claim of usable 1M-token context and two thinking-effort levels. No standard benchmarks are provided, raising concerns about real-world performance. The model targets replacing chunking-based RAG with native long-context reasoning.

MediaTek Other 2026-06-15

Compute Futures Market: Financializing GPU Capacity Could Reshape AI Infrastructure Procurement

Carmen Li is building a GPU pricing index and spot marketplace via Silicon Data and Compute Exchange, aiming to launch compute futures. Backed by DRW, this initiative targets GPU price volatility by standardizing compute trading, potentially creating a trillion-dollar asset class and transforming AI compute procurement.

Cloudflare Other 2026-06-15

Cloudflare Absorbs Ensemble AI: Architectural Model Compression Reshapes Edge Inference Economics

Cloudflare integrates key Ensemble AI talent, bringing NdLinear and NdLinear-LoRA—architectural model compression techniques that preserve multidimensional activations to reduce parameters and compute. This aims to slash inference costs on Workers AI, boost GPU utilization, and accelerate global edge AI deployment.

NVIDIA Other 2026-06-14

NVIDIA Partners SK Telecom for Gigawatt-Scale AI Cloud, Pushes DSX as Sovereign AI Factory Blueprint

SK Telecom plans to build a gigawatt-scale AI cloud in Korea using NVIDIA's DSX platform, with first AI factory online in 2027. The platform integrates NVIDIA accelerated computing, systems, and software to support sovereign, physical, and agentic AI services, targeting expansion across Asia.

NVIDIA Other 2026-06-14

NVIDIA & SK hynix Deepen Memory Co-Engineering: Custom HBM for Vera Rubin and Jetson Thor

NVIDIA and SK hynix have announced a multiyear partnership to co-develop next-generation custom memory for NVIDIA's AI factory ecosystem, including Vera Rubin supercomputers, Vera CPUs, RTX Spark PCs, and Jetson Thor robotic platforms. SK hynix will also use NVIDIA CUDA-X libraries and Omniverse to accelerate semiconductor design and build fab digital twins.

NVIDIA Other 2026-06-14

NVIDIA Vera CPU: Seizing the AI Agent Control Plane from x86

NVIDIA unveils Vera CPU, purpose-built for AI agents, featuring 88 Olympus cores and 1.2TB/s LPDDR5X memory. Claiming 1.8x faster task completion over x86, it targets agentic AI workloads. Customers include Anthropic, OpenAI, and Oracle Cloud Infrastructure, signaling a shift of the AI control plane to NVIDIA's ecosystem.

NVIDIA Other 2026-06-13

NVIDIA AgentPerf Benchmark: Blackwell Ultra Delivers 20x More Agents per Megawatt vs Hopper

NVIDIA and Artificial Analysis unveil AgentPerf, the first benchmark for agentic AI workloads. Results show the GB300 NVL72 platform delivers up to 20x more concurrent agents per megawatt than the HGX H200 when running DeepSeek V4 Pro, using real coding agent trajectories to measure throughput and responsiveness.

NVIDIA Other 2026-06-11

NVIDIA Halos OS: A Certified Safety OS That Seizes Control of Autonomous Driving

NVIDIA introduces Halos OS, a full-stack safety system comprising ASIL D certified Halos Core, standardized Halos SDK, AI guardrails in Halos Applications, and cloud-based Safety Evaluation Framework. Built on DRIVE Hyperion, it aims to embed safety into L4 robotaxis from the ground up.

Cisco Other 2026-06-11

Cisco Cloud Control: The Control Plane Shift to AI-Native Unified Infrastructure and Observability

Cisco unveils Cisco Cloud Control, a new operating model integrating Splunk for AI-native observability and agentic operations. By unifying network infrastructure, data fabric, and AI trust, it aims to reduce MTTR and costs—but also tightens vendor lock-in on both networking and monitoring.

Reports

Filter

NVIDIA Blackwell Sweeps MLPerf: NVLink and NVFP4 Redefine AI Training Economics

HBM Bottleneck Reshapes AI Infrastructure: Asian Memory Makers Gain Leverage Over Nvidia

AMD and Rackspace Deploy 30MW Governed AI Stack: Ecosystem Restructuring from Silicon to Outcomes

Apple Rebuilds Siri with Google Gemini, Cuts Legacy Hardware Support

CrowdStrike's Continuous Identity for AI Agents: Real-Time Risk Engine Replaces Static Policies

Cisco Security Portfolio Moves to AWS Marketplace: Ecosystem Lock-in Accelerates, Multi-Cloud Neutrality Questioned

Cloudflare Announces Scheduled Maintenance and Global Infrastructure Expansion

Cisco G300: A Lock-in Play for AI Network Control Plane Dominance

AMD Acquires MEXT: AI-Predicted Flash Nears DRAM Performance to Cut AI Memory TCO

AMD Open-Sources AI Software Stack on Vultr, Taking on NVIDIA CUDA Ecosystem

NVIDIA's Desktop DGX Station with GB300 Shifts Control from Cloud to Local Hardware

Z.ai GLM-5.2 Ships Usable 1M-Token Context, No Benchmarks, Two Thinking Levels

Compute Futures Market: Financializing GPU Capacity Could Reshape AI Infrastructure Procurement

Cloudflare Absorbs Ensemble AI: Architectural Model Compression Reshapes Edge Inference Economics

NVIDIA Partners SK Telecom for Gigawatt-Scale AI Cloud, Pushes DSX as Sovereign AI Factory Blueprint

NVIDIA & SK hynix Deepen Memory Co-Engineering: Custom HBM for Vera Rubin and Jetson Thor

NVIDIA Vera CPU: Seizing the AI Agent Control Plane from x86

NVIDIA AgentPerf Benchmark: Blackwell Ultra Delivers 20x More Agents per Megawatt vs Hopper

NVIDIA Halos OS: A Certified Safety OS That Seizes Control of Autonomous Driving

Cisco Cloud Control: The Control Plane Shift to AI-Native Unified Infrastructure and Observability