memory - AI Infrastructure Intelligence Search

Hewlett Packard Enterprise Other 2026-06-01

HPE Launches Vera CPU Server for Agentic AI, Reshaping Server Ecosystem

HPE unveils ProLiant DL394 Gen12 with NVIDIA Vera CPU, purpose-built for agentic AI and reinforcement learning. It offers extreme single-core performance and high memory bandwidth, with HPE iLO security and Compute Ops Management. The platform is validated with Redpanda and NYSE for financial workloads.

NVIDIA Other 2026-06-01

NVIDIA BlueField DPU In-Silicon Security Shifts AI Factory Control from Software to Hardware

NVIDIA unveils DOCA security stack (Argus, Vault, Flow) on BlueField-4 DPU, enabling hardware-isolated runtime threat detection via zero-copy memory analysis, zero-trust file access, and 800 Gb/s network enforcement. This shifts security control from host OS to DPU silicon, delivering distributed full-stack protection without compromising AI throughput, but deeply ties to Vera Rubin platform, creating ecosystem lock-in.

NVIDIA Other 2026-06-01

NVIDIA Vera CPU: Custom Olympus Core and LPDDR5X Redefine CPU for Agentic AI Factories

NVIDIA unveils Vera CPU with 88 custom Olympus cores, 1.2TB/s LPDDR5X bandwidth, and SCF fabric, targeting CPU execution bottlenecks in agentic AI and reinforcement learning. Claiming 1.8x performance over x86 and memory power under 30W, it shifts AI factory metrics from cores-per-dollar to tokens-per-dollar.

NVIDIA Other 2026-06-01

NVIDIA DSX OS: Open Source Software to Seize AI Factory Control Plane

NVIDIA launches DSX OS, an open-source modular software suite for operating AI factories. Components include DSX Exchange, MaxLPS, NICo, NVSentinel, etc., unifying IT/OT, power optimization, and lifecycle management. Claims 40% more GPUs under fixed power, but core relies on NVIDIA proprietary hardware, aiming to lock users into its ecosystem.

Intel Other 2026-06-01

Intel Reclaims AI Control Plane: Xeon 6+ and E835 Target Agentic Orchestration

Intel launches Xeon 6+ (288 E-cores on 18A), E835 200GbE controllers, and Crescent Island GPU. The strategy repositions the CPU as the control plane for agentic AI orchestration and data movement, while using E835 Ethernet to standardize AI data center networking.

NVIDIA Other 2026-05-27

NVIDIA Vera CPU Benchmark Crushes x86: Memory Bandwidth Hegemony for Agentic AI

Phoronix benchmarks show NVIDIA Vera CPU with 88 custom Olympus cores (Armv9.2), 1.2TB/s LPDDR5X bandwidth, and 450W TDP outperforming Intel/AMD x86 across agentic AI workloads. It achieves 1.5x overall performance vs 128-core x86, 90% STREAM TRIAD efficiency, and 20-second Linux kernel compilation.

Google Other 2026-05-25

Hardcoded ASP.NET Machine Keys Enable ViewState Deserialization RCE in KnowledgeDeliver LMS

Mandiant reveals that KnowledgeDeliver LMS uses hardcoded ASP.NET machineKeys, enabling unauthenticated RCE (CVE-2026-5426). Attackers craft malicious ViewState payloads, deploy BLUEBEAM in-memory webshell, and infect visitors.

Samsung Electronics Other 2026-05-23

Micron Partners TSMC for Custom HBM4E Logic Dies, Targets 2027 Ramp with 1-gamma DRAM

Micron plans to ramp HBM4E in 2027, transitioning to 1-gamma DRAM and using TSMC for both standard and custom logic dies. This marks a shift from standardized HBM to customized solutions, positioning memory as a strategic asset for AI inference workloads.

Google Other 2026-05-21

Google Antigravity Control Plane Redefines AI Development, Locks Agent Orchestration

At I/O 2026, Google launched Antigravity 2.0 desktop app and CLI/SDK as a unified agent control plane, alongside Gemini 3.5 Flash/Omni models, Managed Agents API, and native Android support in AI Studio. This aims to streamline AI development from prototype to production, but effectively locks developers into Google's ecosystem and cloud services.

AMD Other 2026-05-20

AMD Ryzen AI Halo & Max PRO 400: Local 300B Parameter Inference, but Hidden Lock-in and Thermal Limits

AMD launches Ryzen AI Halo developer platform (128GB unified memory, 200B parameter models) and Ryzen AI Max PRO 400 series (first x86 client to run 300B parameter models locally). Unified memory, ROCm optimization, and OEM partnerships aim to shift agentic AI from cloud to local, but shared memory bandwidth and thermal constraints limit real-world throughput.

Google Other 2026-05-19

Google Cloud I/O '26: A2A Protocol and Managed Agents API Shift Agent Control Plane

At Google I/O '26, Google Cloud unveiled a unified agent development toolkit featuring Antigravity 2.0, Managed Agents API, ADK 2.0, and the A2A protocol. The platform evolves Vertex AI into Gemini Enterprise Agent Platform, offering a four-rung ladder from low-code to code-first. It aims to bridge local prototyping and secure cloud deployment via a shared protocol layer, but effectively centralizes agent lifecycle control onto Google Cloud's managed plane.

Cloudflare Other 2026-05-18

Cloudflare Tests Anthropic Mythos: AI-Driven Exploit Chain Construction and Proof Generation

Cloudflare's Project Glasswing tested Anthropic's Mythos Preview, revealing its ability to automatically chain multiple low-severity bugs into exploitable PoCs with runnable code. They built a multi-stage harness to manage noise and context limits, achieving a significant leap in vulnerability discovery quality.

Cloudflare Other 2026-05-14

Cloudflare's Trio of Patches Breaks ClickHouse Partition Bloat Lock Contention

Cloudflare's billing pipeline slowed after a partitioning change to (namespace, day) in ClickHouse, causing massive lock contention from exploding part counts. Three patches—shared lock, deferred vector copy, and binary search—cut query latency by >50% and decoupled performance from part count.

Cisco Other 2026-05-12

Cisco Replaces Human Annotators with LLM Constitutional Definitions for AI Safety Consistency

Cisco introduces Single-Source Safety Definitions, replacing human annotators with LLMs that re-read 300+ line constitutional documents per classification. This AI-first approach achieves 57x reduction in inter-model disagreement, adds intent/content dual-axis scoring, and becomes the default safety taxonomy for Cisco AI Defense, shifting control from humans to machine-readable specifications.

Cisco Other 2026-05-07

Cisco-AMD Benchmark Shifts AI Fabric Control from GPU to SmartNIC and Switch

Cisco and AMD jointly release benchmarks for AI scale-out fabrics using N9000 800G switches, Pensando Pollara 400 smartNICs, and MI300X GPUs. IBPerf and MLPerf tests show P01/P99 bandwidth near 400Gbps line rate under incast congestion, proving deterministic performance that eliminates GPU stalls.

AMD Other Medium Signal 2026-05-07

AMD Backs SPEC CPU 2026 Benchmark, Emphasizing Open, Trusted Performance Measurement

AMD published a blog endorsing the upcoming SPEC CPU 2026 industry benchmark, emphasizing the critical role of open, reproducible CPU performance standards for customer infrastructure decisions in the AI era. The new benchmark updates its application suite and strengthens support for bare-metal cloud environments and parallel computing.

AMD Other High Signal 2026-05-06

AMD and OpenAI Contribute MRC Protocol to OCP for Scalable AI Networking

AMD, in collaboration with OpenAI, Microsoft, and others, contributed the MRC (Multipath Reliable Connection) protocol, designed for large-scale AI training, to the Open Compute Project (OCP). AMD co-authored the specification and has already deployed MRC on its programmable Pensando DPU/NIC products, positioning its networking technology as a key enabler for resilient and adaptive AI infrastructure.

AMD Other High Signal 2026-05-06

AMD and OpenAI Introduce MRC, a Next-Gen Transport Protocol for AI Training

AMD, in collaboration with OpenAI, Microsoft, and other industry leaders, has released the specification for the Multipath Reliable Connection (MRC) protocol. MRC addresses performance bottlenecks of RoCEv2 in hyperscale AI training clusters through intelligent packet spraying, selective retransmission, and network-signaled congestion control, aiming to improve bandwidth utilization and job resilience.

NVIDIA Other 2026-05-05

NVIDIA Extreme Co-Design: Vera Rubin Platform Targets Agentic Inference TCO Inflection

NVIDIA unveils an extreme co-design stack for agentic systems, featuring Vera Rubin NVL72, NVLink 6, ConnectX-9, BlueField-4, and Spectrum-X. By disaggregating inference, optimizing KV cache management, and deploying low-latency fabrics, it aims to break the throughput-interactivity tradeoff, making high-context token processing economically viable.

AMD Other Medium Signal 2026-05-04

AMD Showcases Heterogeneous Computing Strategy for Enterprise AI with Dell

At Dell Technologies World, AMD highlighted its heterogeneous computing portfolio, aiming to match the right compute engine to specific enterprise AI workloads, while emphasizing hardware-based security and manageability. This signals a shift in AI infrastructure from generic solutions to fine-tuned, scenario-specific deployments.

Reports

Filter

HPE Launches Vera CPU Server for Agentic AI, Reshaping Server Ecosystem

NVIDIA BlueField DPU In-Silicon Security Shifts AI Factory Control from Software to Hardware

NVIDIA Vera CPU: Custom Olympus Core and LPDDR5X Redefine CPU for Agentic AI Factories

NVIDIA DSX OS: Open Source Software to Seize AI Factory Control Plane

Intel Reclaims AI Control Plane: Xeon 6+ and E835 Target Agentic Orchestration

NVIDIA Vera CPU Benchmark Crushes x86: Memory Bandwidth Hegemony for Agentic AI

Hardcoded ASP.NET Machine Keys Enable ViewState Deserialization RCE in KnowledgeDeliver LMS

Micron Partners TSMC for Custom HBM4E Logic Dies, Targets 2027 Ramp with 1-gamma DRAM

Google Antigravity Control Plane Redefines AI Development, Locks Agent Orchestration

AMD Ryzen AI Halo & Max PRO 400: Local 300B Parameter Inference, but Hidden Lock-in and Thermal Limits

Google Cloud I/O '26: A2A Protocol and Managed Agents API Shift Agent Control Plane

Cloudflare Tests Anthropic Mythos: AI-Driven Exploit Chain Construction and Proof Generation

Cloudflare's Trio of Patches Breaks ClickHouse Partition Bloat Lock Contention

Cisco Replaces Human Annotators with LLM Constitutional Definitions for AI Safety Consistency

Cisco-AMD Benchmark Shifts AI Fabric Control from GPU to SmartNIC and Switch

AMD Backs SPEC CPU 2026 Benchmark, Emphasizing Open, Trusted Performance Measurement

AMD and OpenAI Contribute MRC Protocol to OCP for Scalable AI Networking

AMD and OpenAI Introduce MRC, a Next-Gen Transport Protocol for AI Training

NVIDIA Extreme Co-Design: Vera Rubin Platform Targets Agentic Inference TCO Inflection

AMD Showcases Heterogeneous Computing Strategy for Enterprise AI with Dell