GPU - AI Infrastructure Intelligence Search

Cisco Other Medium Signal 2026-03-25

Cisco Validates Rapid Fine-tuning on Private AI Infrastructure with NVIDIA

Cisco IT partnered with NVIDIA to achieve 2-5 hour end-to-end embedding model fine-tuning using Nemotron RAG recipe on a single H200 GPU. The solution uses 120B parameter local LLM for synthetic data generation without manual labeling, improving NDCG@1 by 7.3 absolute points. Validates rapid domain-specific retrieval optimization on private AI infrastructure.

ARM Other High Signal 2026-03-25

ARM Launches AGI CPU Silicon for AI Infrastructure Market

ARM introduced its first production AGI CPU silicon in March 2026, marking a strategic shift from IP licensing to full silicon solutions provider. Designed for next-gen AI infrastructure, this move may reshape the data center processor ecosystem.

Hewlett Packard Enterprise Other High Signal 2026-03-24

HPE Enhances AI Security Architecture for Adoption Risks

HPE introduces SRX400 Series Firewalls, expanded hybrid mesh security, and AI governance capabilities to secure AI adoption. Features include AI app visibility, prompt-level inspection, and identity-based protection to mitigate data exposure risks.

NVIDIA Other High Signal 2026-03-24

NVIDIA Donates GPU Dynamic Resource Allocation Driver to Kubernetes Community

NVIDIA donated its GPU Dynamic Resource Allocation (DRA) driver to the CNCF, making it an upstream Kubernetes project. This move aims to shift the core control point of GPU orchestration from proprietary vendor layers to the open-source community, and drive standardization in collaboration with major cloud providers.

NVIDIA Other High Signal 2026-03-24

NVIDIA Donates GPU Dynamic Resource Allocation Driver to Kubernetes

NVIDIA donated its GPU dynamic resource allocation driver to CNCF, supporting MPS and MIG technologies for intelligent GPU sharing and dynamic reconfiguration. Also added GPU support to Kata Containers for AI workload isolation, with KAI Scheduler joining CNCF sandbox.

ARM Other High Signal 2026-03-24

ARM and NVIDIA Drive Localization Revolution in AI Workstations

ARM and NVIDIA jointly launch DGX Spark AI workstations based on GB10 Grace Blackwell chips, with eight major OEMs releasing products simultaneously. The solution features unified memory architecture supporting 200B parameter models locally, with third-party tests showing 41% faster rendering and 3.2x AI processing speed versus x86 alternatives, enabling seamless cloud-to-edge toolchain migration.

Check Point Other 2026-03-23

Check Point AI Factory Blueprint: Security Control Shifts to NVIDIA DPU and LLM Layer

Check Point unveils AI Factory Security Blueprint, tightly integrating its firewall with NVIDIA BlueField DPU via DOCA. The architecture enforces security at four layers: LLM, AI infrastructure, perimeter, and workload. The new AI Factory Firewall delivers hardware-accelerated threat prevention without consuming CPU/GPU cycles, aiming to embed security into the AI fabric.

Check Point Other High Signal 2026-03-23

Check Point Releases AI Factory Security Blueprint Covering GPU to LLM Protection

Check Point introduces an AI Factory security architecture blueprint, establishing full-stack protection from GPU hardware layer to LLM prompt layer through a zero-trust framework.

Check Point Other High Signal 2026-03-23

Check Point Releases AI Factory Security Blueprint with Layered Protection Architecture

Check Point released an AI Factory Security Blueprint defining an end-to-end security framework from GPU infrastructure to model governance. The architecture embeds security measures throughout the AI development and operations lifecycle, addressing risks like data poisoning and model theft.

NVIDIA Other High Signal 2026-03-21

NVIDIA Blackwell Architecture Achieves 25x Energy Efficiency Gain

NVIDIA's Blackwell GPU architecture delivers 25x energy efficiency improvement over Hopper through Transformer Engine and NVLink innovations. This architectural breakthrough significantly reduces AI training/inference operational costs, directly impacting data center TCO and sustainability metrics.

NVIDIA Other High Signal 2026-03-21

NVIDIA CEO Outlines Accelerated Computing Paradigm, Signaling AI Infrastructure Evolution

In an interview, NVIDIA CEO Jensen Huang systematically elaborated on accelerated computing as a fundamental shift in computer architecture. He emphasized the data center's transition from general-purpose CPUs to specialized acceleration platforms led by GPUs, and believes the future computing stack will be re-architected around accelerated computing.

NVIDIA Other High Signal 2026-03-21

NVIDIA Outlines Three-Stage Accelerated Computing Evolution and Software-Defined Data Center Strategy

NVIDIA CEO outlined a three-stage accelerated computing evolution, progressing from single GPU acceleration to full-stack acceleration, and now entering the software-defined, AI-driven data center phase. The company emphasizes dynamic resource allocation through software-defined infrastructure and reaffirms its full-stack AI strategy from chips to applications.

Cisco Other High Signal 2026-03-20

Cisco and NVIDIA Embed Firewall in DPU for AI Server Security

Cisco extends its Hybrid Mesh Firewall to NVIDIA BlueField DPU, enabling 400G line-rate stateful segmentation security. The solution deploys security capabilities inside AI servers with hardware acceleration to avoid CPU/GPU resource consumption. Designed for AI front-end networks, it supports multi-tenant isolation and automated policy generation.

AMD Other High Signal 2026-03-19

AMD Defines Agent Computer Vision for Edge AI Architecture

AMD releases 2026 AI PC roadmap, proposing Agent Computer concept with expanded Ryzen AI stack featuring NPU-GPU-CPU heterogeneous architecture. Enables local multimodal AI agents, shifting PC from productivity tool to proactive AI partner.

AMD Other Medium Signal 2026-03-19

AMD Highlights CPU's Critical Role in Agentic AI Orchestration and Inference

AMD states Agentic AI workloads require serial decision-making and context management, better suited for CPUs. The company emphasizes high-core-count, high-memory-bandwidth server CPUs will lead in agent orchestration and lightweight inference, complementing GPUs in training. This signals a strategic repositioning of CPUs in AI data center architecture.

Amazon Other High Signal 2026-03-19

AWS and Cerebras Introduce Decoupled Inference Architecture for AI Performance

AWS collaborates with Cerebras on a heterogeneous inference solution using Trainium and CS-3, featuring a decoupled architecture for compute and memory stages connected via EFA. It targets interactive AI applications with claimed 10x performance gain, deployed on Nitro-secured infrastructure.

Cisco Other Medium Signal 2026-03-18

Cisco UCS Integrates NVIDIA Blackwell GPU with Dynamic Resource Pooling

Cisco integrates NVIDIA RTX PRO 4500 Blackwell GPU into UCS platform, supporting deployment from data center to edge. Intersight management enables dynamic GPU resource pooling with real-time PCIe allocation. Validated design blueprints accelerate scalable AI inference and vision AI workloads.

AMD Other High Signal 2026-03-18

AMD and NAVER Cloud Collaborate on Sovereign AI Infrastructure in Korea

AMD and NAVER Cloud announced a strategic collaboration to accelerate sovereign AI infrastructure in Korea. NAVER Cloud will expand deployment of AMD EPYC "Venice" CPUs and gain early access to next-gen Instinct MI455X GPUs, with joint optimization of AI services and software stacks on AMD platforms.

AMD Other High Signal 2026-03-18

AMD and Samsung Deepen Collaboration, Locking HBM4 Supply and Exploring Foundry Partnership

AMD and Samsung signed an MOU, designating Samsung as the primary HBM4 supplier for the next-gen Instinct MI455X GPU and collaborating on DDR5 memory optimized for 6th Gen EPYC CPUs. The companies will also explore opportunities for Samsung to provide foundry services for future AMD products.

NVIDIA Other Medium Signal 2026-03-18

NVIDIA CloudXR Integrates Apple Vision Pro for Enterprise XR Streaming

NVIDIA's CloudXR platform now supports Apple Vision Pro, enabling high-fidelity XR content streaming from cloud or local workstations with RTX GPUs. This addresses mobile headset compute limitations for enterprise applications like industrial design and digital twins.

Reports

Filter

Cisco Validates Rapid Fine-tuning on Private AI Infrastructure with NVIDIA

ARM Launches AGI CPU Silicon for AI Infrastructure Market

HPE Enhances AI Security Architecture for Adoption Risks

NVIDIA Donates GPU Dynamic Resource Allocation Driver to Kubernetes Community

NVIDIA Donates GPU Dynamic Resource Allocation Driver to Kubernetes

ARM and NVIDIA Drive Localization Revolution in AI Workstations

Check Point AI Factory Blueprint: Security Control Shifts to NVIDIA DPU and LLM Layer

Check Point Releases AI Factory Security Blueprint Covering GPU to LLM Protection

Check Point Releases AI Factory Security Blueprint with Layered Protection Architecture

NVIDIA Blackwell Architecture Achieves 25x Energy Efficiency Gain

NVIDIA CEO Outlines Accelerated Computing Paradigm, Signaling AI Infrastructure Evolution

NVIDIA Outlines Three-Stage Accelerated Computing Evolution and Software-Defined Data Center Strategy

Cisco and NVIDIA Embed Firewall in DPU for AI Server Security

AMD Defines Agent Computer Vision for Edge AI Architecture

AMD Highlights CPU's Critical Role in Agentic AI Orchestration and Inference

AWS and Cerebras Introduce Decoupled Inference Architecture for AI Performance

Cisco UCS Integrates NVIDIA Blackwell GPU with Dynamic Resource Pooling

AMD and NAVER Cloud Collaborate on Sovereign AI Infrastructure in Korea

AMD and Samsung Deepen Collaboration, Locking HBM4 Supply and Exploring Foundry Partnership

NVIDIA CloudXR Integrates Apple Vision Pro for Enterprise XR Streaming