DPU - AI Infrastructure Intelligence Search

NVIDIA Other High Signal 2026-04-30

NVIDIA Releases Enterprise AI Factory Reference Architectures, Standardizing On-Premises AI Infrastructure

NVIDIA has released Enterprise AI Factory Reference Architectures, offering three standardized configurations from RTX PRO to NVL72 for on-premises deployments. This architecture integrates compute, networking, storage, and software, aiming to transform AI infrastructure from experimental setups into predictable, scalable industrial operational platforms.

AMD Other High Signal 2026-04-29

AMD and Liquid AI Discuss Efficient AI Architecture from Silicon to Systems

AMD's CTO and Liquid AI's CEO discuss the evolution of AI architecture, emphasizing efficiency as key to extending AI from the cloud to edge and endpoint devices. They argue that co-design from silicon to systems enables low-power, responsive AI inference, supporting always-on agents and multi-model orchestration.

AMD Other High Signal 2026-04-27

AMD Extends Edge AI Architecture to Space, Defining Orbital Computing Paradigm

AMD's CTO proposes applying the core principles of 'performance-per-watt' and 'mission-critical reliability' from terrestrial edge AI to space computing. The company is providing a repeatable platform foundation for in-orbit satellite intelligence and future orbital data centers through heterogeneous computing, open software stacks, and modular system design.

AMD Other High Signal 2026-04-27

AMD Highlights AI PC as Critical Infrastructure for Enterprise Agentic AI in IDC White Paper

AMD released an IDC white paper indicating that over 80% of enterprises are planning, piloting, or deploying AI PCs to support scaled Agentic AI. The report highlights high-performance NPUs and on-device AI processing as critical for enabling real-time, secure workflows, signaling a shift in enterprise AI infrastructure from cloud to endpoint.

AMD Other High Signal 2026-04-02

AMD Announces Breakthrough MLPerf Inference 6.0 Results, Showcasing Multinode Scaling and Multimodal Capabilities

AMD's MLPerf Inference 6.0 submission, powered by Instinct MI355X GPUs, surpassed 1 million tokens per second for the first time on models like Llama 2 70B and GPT-OSS-120B. The results highlight efficient multinode scaling, rapid enablement of new workloads (e.g., text-to-video model Wan-2.2-t2v), and reproducible performance across a broad partner ecosystem.

Check Point Other 2026-03-23

Check Point AI Factory Blueprint: Security Control Shifts to NVIDIA DPU and LLM Layer

Check Point unveils AI Factory Security Blueprint, tightly integrating its firewall with NVIDIA BlueField DPU via DOCA. The architecture enforces security at four layers: LLM, AI infrastructure, perimeter, and workload. The new AI Factory Firewall delivers hardware-accelerated threat prevention without consuming CPU/GPU cycles, aiming to embed security into the AI fabric.

NVIDIA Other High Signal 2026-03-21

NVIDIA Outlines Three-Stage Accelerated Computing Evolution and Software-Defined Data Center Strategy

NVIDIA CEO outlined a three-stage accelerated computing evolution, progressing from single GPU acceleration to full-stack acceleration, and now entering the software-defined, AI-driven data center phase. The company emphasizes dynamic resource allocation through software-defined infrastructure and reaffirms its full-stack AI strategy from chips to applications.

Cisco Other High Signal 2026-03-20

Cisco and NVIDIA Embed Firewall in DPU for AI Server Security

Cisco extends its Hybrid Mesh Firewall to NVIDIA BlueField DPU, enabling 400G line-rate stateful segmentation security. The solution deploys security capabilities inside AI servers with hardware acceleration to avoid CPU/GPU resource consumption. Designed for AI front-end networks, it supports multi-tenant isolation and automated policy generation.

AMD Other High Signal 2026-03-18

AMD and NAVER Cloud Collaborate on Sovereign AI Infrastructure in Korea

AMD and NAVER Cloud announced a strategic collaboration to accelerate sovereign AI infrastructure in Korea. NAVER Cloud will expand deployment of AMD EPYC "Venice" CPUs and gain early access to next-gen Instinct MI455X GPUs, with joint optimization of AI services and software stacks on AMD platforms.

AMD Other High Signal 2026-03-18

AMD and Samsung Deepen Collaboration, Locking HBM4 Supply and Exploring Foundry Partnership

AMD and Samsung signed an MOU, designating Samsung as the primary HBM4 supplier for the next-gen Instinct MI455X GPU and collaborating on DDR5 memory optimized for 6th Gen EPYC CPUs. The companies will also explore opportunities for Samsung to provide foundry services for future AMD products.

Hewlett Packard Enterprise Other High Signal 2026-03-17

HPE Unveils AI Grid Solution for AI WAN Fabric with NVIDIA

HPE announced a collaboration with NVIDIA to launch the AI Grid Solution, securely scaling edge AI. The solution transforms WAN into an AI WAN fabric, connecting distributed inference sites with AI factories for consistent policy and predictable performance. It enables service providers to evolve from connectivity to AI services.

Cisco Other High Signal 2026-03-17

Cisco Expands Secure AI Factory with NVIDIA to Edge and Security

Cisco expands its Secure AI Factory with NVIDIA to enable AI deployment from data centers to edge sites, adding security capabilities like firewall policy enforcement on DPUs and AI Defense integration, offering flexible architecture options to accelerate production scaling.

Cisco Other High Signal 2026-03-17

Cisco and NVIDIA Extend Secure AI Factory with Network-Security Integration

Cisco and NVIDIA deepen collaboration on Secure AI Factory, extending AI deployment from core to edge. Launch high-performance switches with NVIDIA Spectrum and expand security enforcement to DPU level with AI guardrails integration.

NVIDIA Other High Signal 2026-03-17

NVIDIA Releases AI Factory Reference Design and Digital Twin Blueprint

NVIDIA unveiled Vera Rubin DSX AI factory reference design and Omniverse DSX digital twin blueprint, built on Spectrum-X Ethernet, Quantum-X800 InfiniBand and BlueField-3 DPU. The architecture connects real-world sensors with digital twins for continuous AI model training and optimization, extending AI computing from data centers to physical world automation.

Cisco Other High Signal 2026-02-10

Cisco Launches G300 Chip and Systems for AI Agent-Era Data Center Networking

Cisco introduces 102.4Tbps Silicon One G300 switching chip with liquid-cooled N9000/8000 systems delivering 70% energy efficiency, 1.6T optics support, and Nexus One unified management plane upgrade.

Check Point Other High Signal 2025-10-28

Check Point Deploys AI Firewall Architecture on NVIDIA DPU Platform

Check Point launches AI Factory Firewall leveraging NVIDIA BlueField-3 DPUs for securing AI workloads. The architecture shifts policy enforcement to DPU layer with hardware-accelerated AI traffic inspection while maintaining unified policy management framework.

NVIDIA Other 1970-01-01

NVIDIA Acquires Groq LPU: Inference Architecture Shift from HBM to On-Chip SRAM

NVIDIA signs ~$20B licensing deal with Groq for LPU tech, featuring 230MB on-chip SRAM at 80TB/s bandwidth. This targets Transformer inference decode, replacing HBM bottlenecks with ultra-low latency on-chip storage, potentially reshaping the AI inference chip landscape.