Filter

×
Active Filters Clear All
Keyword: AI infrastructure ×
106 Total Reports
1/6 Page
AMD Other 2026-06-16

AMD and Rackspace Deploy 30MW Governed AI Stack: Ecosystem Restructuring from Silicon to Outcomes

AMD and Rackspace sign a definitive agreement to deploy 30MW of AMD AI compute (Instinct GPUs including MI355X, EPYC CPUs) across Rackspace's data centers, creating a governed enterprise AI stack with single accountability from silicon to outcomes, targeting regulated industries.

Cloudflare Other 2026-06-15

Cloudflare Announces Scheduled Maintenance and Global Infrastructure Expansion

...

AMD Other 2026-06-15

AMD Acquires MEXT: AI-Predicted Flash Nears DRAM Performance to Cut AI Memory TCO

AMD acquires MEXT, an AI-driven memory optimization startup. MEXT's predictive technology makes NAND Flash behave like DRAM, expanding effective memory capacity for AI workloads and lowering TCO. The tech will be integrated across AMD's data center portfolio (EPYC, Instinct) to address memory bottlenecks in large models.

AMD Other 2026-06-15

AMD Open-Sources AI Software Stack on Vultr, Taking on NVIDIA CUDA Ecosystem

AMD launches a suite of open-source, modular enterprise AI software components on Vultr Marketplace, including AMD Inference Microservices (AIMs), AI Workbench, Resource Manager, and Solution Blueprints. This aims to provide production-grade AI infrastructure without vendor lock-in, directly challenging NVIDIA's CUDA ecosystem.

NVIDIA Other 2026-06-15

NVIDIA's Desktop DGX Station with GB300 Shifts Control from Cloud to Local Hardware

ASUS launches ExpertCenter Pro ET900N G3, built on NVIDIA DGX Station GB300 architecture with GB300 Grace Blackwell Ultra chip, 748GB coherent memory, and 20 PFLOPS AI performance. This deskside AI supercomputer enables local LLM fine-tuning, inference, and agentic AI workflows via NVLink-C2C and the full NVIDIA AI software stack including NemoClaw.

Research Other 2026-06-15

Z.ai GLM-5.2 Ships Usable 1M-Token Context, No Benchmarks, Two Thinking Levels

Z.ai releases GLM-5.2 with a claim of usable 1M-token context and two thinking-effort levels. No standard benchmarks are provided, raising concerns about real-world performance. The model targets replacing chunking-based RAG with native long-context reasoning.

MediaTek Other 2026-06-15

Compute Futures Market: Financializing GPU Capacity Could Reshape AI Infrastructure Procurement

Carmen Li is building a GPU pricing index and spot marketplace via Silicon Data and Compute Exchange, aiming to launch compute futures. Backed by DRW, this initiative targets GPU price volatility by standardizing compute trading, potentially creating a trillion-dollar asset class and transforming AI compute procurement.

NVIDIA Other 2026-06-14

NVIDIA & SK hynix Deepen Memory Co-Engineering: Custom HBM for Vera Rubin and Jetson Thor

NVIDIA and SK hynix have announced a multiyear partnership to co-develop next-generation custom memory for NVIDIA's AI factory ecosystem, including Vera Rubin supercomputers, Vera CPUs, RTX Spark PCs, and Jetson Thor robotic platforms. SK hynix will also use NVIDIA CUDA-X libraries and Omniverse to accelerate semiconductor design and build fab digital twins.

NVIDIA Other 2026-06-13

NVIDIA AgentPerf Benchmark: Blackwell Ultra Delivers 20x More Agents per Megawatt vs Hopper

NVIDIA and Artificial Analysis unveil AgentPerf, the first benchmark for agentic AI workloads. Results show the GB300 NVL72 platform delivers up to 20x more concurrent agents per megawatt than the HGX H200 when running DeepSeek V4 Pro, using real coding agent trajectories to measure throughput and responsiveness.

NVIDIA Other 2026-06-11

NVIDIA Halos OS: A Certified Safety OS That Seizes Control of Autonomous Driving

NVIDIA introduces Halos OS, a full-stack safety system comprising ASIL D certified Halos Core, standardized Halos SDK, AI guardrails in Halos Applications, and cloud-based Safety Evaluation Framework. Built on DRIVE Hyperion, it aims to embed safety into L4 robotaxis from the ground up.

Cisco Other 2026-06-11

Cisco Cloud Control: The Control Plane Shift to AI-Native Unified Infrastructure and Observability

Cisco unveils Cisco Cloud Control, a new operating model integrating Splunk for AI-native observability and agentic operations. By unifying network infrastructure, data fabric, and AI trust, it aims to reduce MTTR and costs—but also tightens vendor lock-in on both networking and monitoring.

NVIDIA Other 2026-06-11

NVIDIA Locks Local AI Inference Control with DiffusionGemma Parallel Generation

NVIDIA optimizes Google DeepMind's DiffusionGemma open model, which generates 256 tokens in parallel for 4x speedup over autoregressive models. Achieves 1000 tokens/sec on H100, 150 tokens/sec on DGX Spark, running fully locally with no cloud cost. This reinforces NVIDIA GPU's centrality in compute-bound local AI inference.

AMD Other 2026-06-11

AMD, Dell, Cambridge Launch UK Sovereign AI Lab to Challenge NVIDIA's CUDA Dominance with Open ROCm

AMD, Dell, and the University of Cambridge launch the Sovereign AI Innovation Lab (SAIL) in the UK, deploying Zenith supercomputer with 5th Gen EPYC and Instinct MI355X GPUs, plus the Sunrise fusion AI system. The lab promotes open, interoperable AI infrastructure based on AMD ROCm, challenging NVIDIA's CUDA lock-in and offering long-term technology choice for national AI initiatives.

NVIDIA Other 2026-06-10

NVIDIA Integrates BESS into AI Factory Power Architecture: Control Plane Shifts to Smart Storage

NVIDIA integrates Battery Energy Storage Systems (BESS) as a system-level component within its DSX platform for AI factories, shifting power infrastructure from passive backup to active control. BESS combines inverters, real-time telemetry, and dynamic control for load smoothing, ride-through, and faster grid interconnection, with self-qualification guidelines setting new validation standards.

ARM Other 2026-06-10

Arm's Neural Dawn: Dedicated Neural Accelerators Redefine Mobile GPU Roadmap

Arm and Sumo Digital unveil Neural Dawn, the first mobile game to use Unreal Engine MegaLights. By integrating dedicated neural accelerators into next-gen Mali GPUs, it delivers desktop-class ray-traced lighting within mobile power limits, signaling a shift from traditional to AI-native graphics pipelines.

NVIDIA Other 2026-06-10

Delivering Lifecycle Control for AI Infrastructure at Scale with NVIDIA DGX Spark Enterprise Manageability

Delivering Lifecycle Control for AI Infrastructure at Scale with NVIDIA DGX Spark Enterprise Manageability2026-06-09T19:00:00+00:00As AI infrastructure scales, enterprise expectations for operational ...

AMD Other 2026-06-10

AMD EPYC Challenges Rack-Scale Density for Agentic AI Control

AMD claims its EPYC processors lead in rack-scale performance for agentic AI's CPU-intensive services (orchestration, caching, databases). Under a 100kW rack model, EPYC 9965 'Turin' delivers 2.37x throughput over NVIDIA Vera, with next-gen 'Venice' projected at 3.30x. Emphasizes deployability on current x86 platforms, avoiding future architecture dependency.

Google Other 2026-06-09

GKE Inference Gateway Prefix Caching: 92% Faster AI Inference with Hidden Lock-in

Google Cloud launches GKE Inference Gateway with prefix caching and model-aware routing, achieving 92.8% lower TTFT and 15.7% higher throughput on Llama 3.1 8B. Snap reports 75-80% cache hit rates. However, deep integration with GKE Gateway API risks lock-in, limiting multi-cloud portability.

NVIDIA Other 2026-06-08

NVIDIA's UK Sovereign AI Play: From Chip Vendor to National Infrastructure Controller

NVIDIA partners with the UK government to deploy sovereign AI infrastructure via Isambard-AI (5,400 GH200 superchips) and the Sovereign AI Fund, backing local startups. This move establishes a national AI control plane, locking compute into NVIDIA's ecosystem and bypassing traditional hyperscalers like AWS and Azure.

NVIDIA Other 2026-06-08

NVIDIA and LG Build AI Factory: DSX Platform Locks Physical AI Stack

NVIDIA and LG Group jointly build an AI factory leveraging NVIDIA's DSX platform, integrating Isaac Sim/Lab, Cosmos, GR00T frameworks for robotics, autonomous driving, data centers, and sovereign AI. LG subsidiaries align cooling, robotics, and sensor components exclusively with NVIDIA, creating a fortified ecosystem.