Filter

×
Active Filters Clear All
Keyword: AI Infrastructure ×
95 Total Reports
2/5 Page
Hewlett Packard Enterprise Other 2026-06-01

HPE Launches Vera CPU Server for Agentic AI, Reshaping Server Ecosystem

HPE unveils ProLiant DL394 Gen12 with NVIDIA Vera CPU, purpose-built for agentic AI and reinforcement learning. It offers extreme single-core performance and high memory bandwidth, with HPE iLO security and Compute Ops Management. The platform is validated with Redpanda and NYSE for financial workloads.

NVIDIA Other 2026-06-01

NVIDIA BlueField DPU In-Silicon Security Shifts AI Factory Control from Software to Hardware

NVIDIA unveils DOCA security stack (Argus, Vault, Flow) on BlueField-4 DPU, enabling hardware-isolated runtime threat detection via zero-copy memory analysis, zero-trust file access, and 800 Gb/s network enforcement. This shifts security control from host OS to DPU silicon, delivering distributed full-stack protection without compromising AI throughput, but deeply ties to Vera Rubin platform, creating ecosystem lock-in.

NVIDIA Other 2026-06-01

NVIDIA DSX OS: Open Source Software to Seize AI Factory Control Plane

NVIDIA launches DSX OS, an open-source modular software suite for operating AI factories. Components include DSX Exchange, MaxLPS, NICo, NVSentinel, etc., unifying IT/OT, power optimization, and lifecycle management. Claims 40% more GPUs under fixed power, but core relies on NVIDIA proprietary hardware, aiming to lock users into its ecosystem.

Intel Other 2026-06-01

Intel Reclaims AI Control Plane: Xeon 6+ and E835 Target Agentic Orchestration

Intel launches Xeon 6+ (288 E-cores on 18A), E835 200GbE controllers, and Crescent Island GPU. The strategy repositions the CPU as the control plane for agentic AI orchestration and data movement, while using E835 Ethernet to standardize AI data center networking.

Cisco Other 2026-05-28

Cisco Scale-Across: Converged Silicon and Optics for Distributed AI Training

Cisco unveils Scale-Across architecture combining Silicon One P200 routing (51.2Tbps) and coherent pluggables (400G/800G ZR/ZR+) with open line systems, enabling deterministic low-latency, lossless connectivity for distributed AI training across data centers separated by tens of kilometers.

NVIDIA Other 2026-05-27

NVIDIA Vera CPU Benchmark Crushes x86: Memory Bandwidth Hegemony for Agentic AI

Phoronix benchmarks show NVIDIA Vera CPU with 88 custom Olympus cores (Armv9.2), 1.2TB/s LPDDR5X bandwidth, and 450W TDP outperforming Intel/AMD x86 across agentic AI workloads. It achieves 1.5x overall performance vs 128-core x86, 90% STREAM TRIAD efficiency, and 20-second Linux kernel compilation.

Cisco Other 2026-05-20

Cisco G300 Intelligent Packet Flow: Hardware-Accelerated AI Networking Breakthrough

Cisco launches Intelligent Packet Flow on Silicon One G300, transforming the fabric into an intelligent system with hardware-accelerated adaptive routing, collective congestion awareness, and telemetry. In 8K-16K GPU clusters, it reduces CCT by 87% vs ECMP, improves JCT by 82%, and unlocks 28% more GPU efficiency.

Intel Other 2026-05-20

Intel Core Ultra 3 SoC Replaces Discrete GPUs in Edge Robotics, Slashing TCO

Intel Core Ultra Series 3 SoC integrates CPU, GPU, and NPU to power edge robotics, replacing discrete GPUs. Partners like Sensory AI run multi-agent AI (vision, language, motion) locally, cutting TCO and eliminating cloud latency. This shifts the cost-performance curve for service robots.

AMD Other 2026-05-20

AMD Ryzen AI Halo & Max PRO 400: Local 300B Parameter Inference, but Hidden Lock-in and Thermal Limits

AMD launches Ryzen AI Halo developer platform (128GB unified memory, 200B parameter models) and Ryzen AI Max PRO 400 series (first x86 client to run 300B parameter models locally). Unified memory, ROCm optimization, and OEM partnerships aim to shift agentic AI from cloud to local, but shared memory bandwidth and thermal constraints limit real-world throughput.

Cisco Other 2026-05-14

Cisco Unified Edge: Service Providers' New Ecosystem Bundle for Edge AI Services

Cisco launches Unified Edge platform integrating compute, networking, storage, and security, managed via Intersight, targeting service providers to deploy AI inference at thousands of edge sites. Verizon as early adopter plans to bundle edge capabilities into enterprise connectivity offerings.

Cisco Other 2026-05-14

Cisco Uses MRC to Push SRv6: A Stealth Power Grab in AI Networking

Cisco claims MRC protocol is built on its SRv6 architecture, highlighting application-driven networking, static routing reliability, and deterministic visibility. This is a strategic move to lock AI networking into Cisco's SRv6 ecosystem, countering NVIDIA's Spectrum-X and Arista's alternatives.

Cisco Other 2026-05-13

Cisco N9300 Smart Switches Embed Security into AI Data Center Fabric

At ONUG 2026, Cisco unveiled Nexus One architecture and N9300 Smart Switches, embedding L4 segmentation, Hypershield, eBPF-based Live Protect, and DPU-integrated firewall directly into the network fabric. This aims to deliver bottleneck-free security for AI workloads while enabling AI-driven operations via AgenticOps and AI Canvas.

Cisco Other 2026-05-07

Cisco-AMD Benchmark Shifts AI Fabric Control from GPU to SmartNIC and Switch

Cisco and AMD jointly release benchmarks for AI scale-out fabrics using N9000 800G switches, Pensando Pollara 400 smartNICs, and MI300X GPUs. IBPerf and MLPerf tests show P01/P99 bandwidth near 400Gbps line rate under incast congestion, proving deterministic performance that eliminates GPU stalls.

ARM Other High Signal 2026-05-07

Arm Reports Record Results, AGI CPU Emerges as New AI Infrastructure Focal Point

Arm reported record FY2026 results with $4.92B revenue and over 20% growth for three consecutive years. The core highlight is the Arm AGI CPU designed for agentic AI, securing over $2B in customer demand and backing from Meta, AWS, Google, and others.

AMD Other Medium Signal 2026-05-07

AMD Backs SPEC CPU 2026 Benchmark, Emphasizing Open, Trusted Performance Measurement

AMD published a blog endorsing the upcoming SPEC CPU 2026 industry benchmark, emphasizing the critical role of open, reproducible CPU performance standards for customer infrastructure decisions in the AI era. The new benchmark updates its application suite and strengthens support for bare-metal cloud environments and parallel computing.

AMD Other High Signal 2026-05-06

AMD and OpenAI Contribute MRC Protocol to OCP for Scalable AI Networking

AMD, in collaboration with OpenAI, Microsoft, and others, contributed the MRC (Multipath Reliable Connection) protocol, designed for large-scale AI training, to the Open Compute Project (OCP). AMD co-authored the specification and has already deployed MRC on its programmable Pensando DPU/NIC products, positioning its networking technology as a key enabler for resilient and adaptive AI infrastructure.

NVIDIA Other High Signal 2026-05-06

NVIDIA Opens MRC Protocol via OCP, Pushing Standardization of AI Ethernet Fabrics

NVIDIA announced the opening of its MRC (Multipath Reliable Connection) RDMA transport protocol via the Open Compute Project (OCP). The protocol, proven on Spectrum-X Ethernet hardware, aims to enhance throughput, resilience, and GPU utilization for large-scale AI training clusters through multi-path load balancing and hardware-level failure bypass.

AMD Other High Signal 2026-05-06

AMD and OpenAI Introduce MRC, a Next-Gen Transport Protocol for AI Training

AMD, in collaboration with OpenAI, Microsoft, and other industry leaders, has released the specification for the Multipath Reliable Connection (MRC) protocol. MRC addresses performance bottlenecks of RoCEv2 in hyperscale AI training clusters through intelligent packet spraying, selective retransmission, and network-signaled congestion control, aiming to improve bandwidth utilization and job resilience.

Anthropic Other High Signal 2026-05-06

Anthropic Secures Compute Deal with SpaceX, Significantly Boosting Claude Capacity

Anthropic announced a partnership with SpaceX to utilize all compute capacity at the Colossus 1 data center, gaining over 300MW of new capacity. This move aims to directly improve service for Claude Pro and Max subscribers, with immediate increases to Claude Code and API rate limits.

Intel Other Medium Signal 2026-05-06

Intel at Computex 2026 Emphasizes CPU's Critical Role in AI Compute

Intel will outline its vision for the AI-driven computing era at Computex 2026, centering on the resurgence of the CPU as a critical AI engine. It emphasizes CPU-GPU/accelerator synergy to build efficient, scalable AI systems atop the broad x86 ecosystem.