AMD - AI Infrastructure Intelligence Search

AMD Other 2026-05-20

AMD Ryzen AI Halo & Max PRO 400: Local 300B Parameter Inference, but Hidden Lock-in and Thermal Limits

AMD launches Ryzen AI Halo developer platform (128GB unified memory, 200B parameter models) and Ryzen AI Max PRO 400 series (first x86 client to run 300B parameter models locally). Unified memory, ROCm optimization, and OEM partnerships aim to shift agentic AI from cloud to local, but shared memory bandwidth and thermal constraints limit real-world throughput.

Intel Other 2026-05-16

AI Agent Workloads Trigger Structural CPU Shortage, Arm and AMD Reshape Server Value Chain

AI inference and agent orchestration surge CPU demand, shifting CPU-GPU ratio from 1:8 to 1:1. AMD EPYC lead time 8-12 weeks, Intel Xeon up to 6 months; Arm's 3nm 136-core AGI processor co-developed with Meta/Cerebras/Cloudflare/OpenAI sees demand exceeding 200 billion USD. CPU replaces GPU as the new AI infrastructure bottleneck, with Arm and AMD reshaping the value chain.

Cisco Other 2026-05-14

Cisco Uses MRC to Push SRv6: A Stealth Power Grab in AI Networking

Cisco claims MRC protocol is built on its SRv6 architecture, highlighting application-driven networking, static routing reliability, and deterministic visibility. This is a strategic move to lock AI networking into Cisco's SRv6 ecosystem, countering NVIDIA's Spectrum-X and Arista's alternatives.

Microsoft Other 2026-05-14

Microsoft's DQI at WinHEC 2026: Shifting Driver Control from IHVs to Microsoft

At WinHEC 2026, Microsoft announced the Driver Quality Initiative (DQI), centered on transitioning third-party kernel-mode drivers to user-mode or Microsoft-authored class drivers, alongside enhanced trust verification, lifecycle management, and quality metrics. This aims to systematically improve Windows driver quality but effectively consolidates Microsoft's control over the driver ecosystem.

Cisco Other 2026-05-07

Cisco-AMD Benchmark Shifts AI Fabric Control from GPU to SmartNIC and Switch

Cisco and AMD jointly release benchmarks for AI scale-out fabrics using N9000 800G switches, Pensando Pollara 400 smartNICs, and MI300X GPUs. IBPerf and MLPerf tests show P01/P99 bandwidth near 400Gbps line rate under incast congestion, proving deterministic performance that eliminates GPU stalls.

AMD Other Medium Signal 2026-05-07

AMD Backs SPEC CPU 2026 Benchmark, Emphasizing Open, Trusted Performance Measurement

AMD published a blog endorsing the upcoming SPEC CPU 2026 industry benchmark, emphasizing the critical role of open, reproducible CPU performance standards for customer infrastructure decisions in the AI era. The new benchmark updates its application suite and strengthens support for bare-metal cloud environments and parallel computing.

AMD Other High Signal 2026-05-06

AMD and OpenAI Contribute MRC Protocol to OCP for Scalable AI Networking

AMD, in collaboration with OpenAI, Microsoft, and others, contributed the MRC (Multipath Reliable Connection) protocol, designed for large-scale AI training, to the Open Compute Project (OCP). AMD co-authored the specification and has already deployed MRC on its programmable Pensando DPU/NIC products, positioning its networking technology as a key enabler for resilient and adaptive AI infrastructure.

NVIDIA Other High Signal 2026-05-06

NVIDIA Opens MRC Protocol via OCP, Pushing Standardization of AI Ethernet Fabrics

NVIDIA announced the opening of its MRC (Multipath Reliable Connection) RDMA transport protocol via the Open Compute Project (OCP). The protocol, proven on Spectrum-X Ethernet hardware, aims to enhance throughput, resilience, and GPU utilization for large-scale AI training clusters through multi-path load balancing and hardware-level failure bypass.

AMD Other High Signal 2026-05-06

AMD and OpenAI Introduce MRC, a Next-Gen Transport Protocol for AI Training

AMD, in collaboration with OpenAI, Microsoft, and other industry leaders, has released the specification for the Multipath Reliable Connection (MRC) protocol. MRC addresses performance bottlenecks of RoCEv2 in hyperscale AI training clusters through intelligent packet spraying, selective retransmission, and network-signaled congestion control, aiming to improve bandwidth utilization and job resilience.

AMD Other Medium Signal 2026-05-04

AMD Showcases Heterogeneous Computing Strategy for Enterprise AI with Dell

At Dell Technologies World, AMD highlighted its heterogeneous computing portfolio, aiming to match the right compute engine to specific enterprise AI workloads, while emphasizing hardware-based security and manageability. This signals a shift in AI infrastructure from generic solutions to fine-tuned, scenario-specific deployments.

AMD Other High Signal 2026-04-30

AMD Proposes New AI Infrastructure Networking Paradigm: From Lossless Fabrics to Intelligent Endpoints

AMD published a blog outlining seven key questions for building large-scale AI infrastructure, arguing that traditional lossless Ethernet or InfiniBand architectures face cost and complexity bottlenecks. It advocates shifting network intelligence and reliability functions from expensive, specialized switches to intelligent NICs, enabling reliable transport over standard (potentially lossy) Ethernet to reduce TCO and simplify operations.

AMD Other High Signal 2026-04-29

AMD and Liquid AI Discuss Efficient AI Architecture from Silicon to Systems

AMD's CTO and Liquid AI's CEO discuss the evolution of AI architecture, emphasizing efficiency as key to extending AI from the cloud to edge and endpoint devices. They argue that co-design from silicon to systems enables low-power, responsive AI inference, supporting always-on agents and multi-model orchestration.

AMD Other High Signal 2026-04-27

AMD Extends Edge AI Architecture to Space, Defining Orbital Computing Paradigm

AMD's CTO proposes applying the core principles of 'performance-per-watt' and 'mission-critical reliability' from terrestrial edge AI to space computing. The company is providing a repeatable platform foundation for in-orbit satellite intelligence and future orbital data centers through heterogeneous computing, open software stacks, and modular system design.

AMD Other High Signal 2026-04-27

AMD Highlights AI PC as Critical Infrastructure for Enterprise Agentic AI in IDC White Paper

AMD released an IDC white paper indicating that over 80% of enterprises are planning, piloting, or deploying AI PCs to support scaled Agentic AI. The report highlights high-performance NPUs and on-device AI processing as critical for enabling real-time, secure workflows, signaling a shift in enterprise AI infrastructure from cloud to endpoint.

Google Other 2026-04-22

Google Cloud Next '26: Agent Gateway Seizes Control Plane, TPU 8i Locks Inference

Google Cloud Next '26 announces 8th-gen TPUs (8t for training, 8i for inference), Agent Platform with Agent Gateway, Agent Identity, Agent-to-Agent Orchestration, Agentic Data Cloud, and Agentic Defense integrating Wiz. The move shifts control from infrastructure to agent orchestration, locking enterprises into a vertically integrated stack.

Google Other High Signal 2026-04-03

Google Launches Gemma 4 Open Models, Targeting Edge Inference and AI Agent Architecture

Google introduces the Gemma 4 open model family, with four sizes from 2B to 31B parameters, emphasizing breakthrough intelligence-per-parameter and native support for agentic workflows, multimodality, and long context. The small models are engineered for edge devices, aiming to bring frontier reasoning to mobile and IoT scenarios.

Google Other Medium Signal 2026-04-03

Google Launches Gemma 4 Open Model Family

Google introduces Gemma 4 open model family with four size variants, optimized for edge and mobile devices. The series supports multimodal processing, long context windows and 140+ languages under Apache 2.0 license.

AMD Other High Signal 2026-04-02

AMD Announces Breakthrough MLPerf Inference 6.0 Results, Showcasing Multinode Scaling and Multimodal Capabilities

AMD's MLPerf Inference 6.0 submission, powered by Instinct MI355X GPUs, surpassed 1 million tokens per second for the first time on models like Llama 2 70B and GPT-OSS-120B. The results highlight efficient multinode scaling, rapid enablement of new workloads (e.g., text-to-video model Wan-2.2-t2v), and reproducible performance across a broad partner ecosystem.

AMD Other Medium Signal 2026-04-02

AMD Achieves Breakthrough MLPerf Inference Results

AMD reports its Instinct MI300X accelerators achieved outstanding performance in MLPerf Inference 6.0 benchmarks, setting new records in natural language processing tasks. This demonstrates AMD's growing technical competitiveness in AI inference infrastructure.

AMD Other 2026-03-31

AMD Highlights Use of Its High-Reliability Compute Products in Space Missions

AMD published a promotional article highlighting that its processors and FPGAs have provided compute support for multiple space missions, including NASA's lunar rovers and landers. The article aims to demonstrate the reliability and durability of its products in extreme environments.

Reports

Filter