Reports
AI-generated structured vendor updates
Cisco Validates Rapid Fine-tuning on Private AI Infrastructure with NVIDIA
Cisco IT partnered with NVIDIA to achieve 2-5 hour end-to-end embedding model fine-tuning using Nemotron RAG recipe on a single H200 GPU. The solution uses 120B parameter local LLM for synthetic data generation without manual labeling, improving NDCG@1 by 7.3 absolute points. Validates rapid domain-specific retrieval optimization on private AI infrastructure.
ARM Launches AGI CPU Silicon for AI Infrastructure Market
ARM introduced its first production AGI CPU silicon in March 2026, marking a strategic shift from IP licensing to full silicon solutions provider. Designed for next-gen AI infrastructure, this move may reshape the data center processor ecosystem.
HPE Enhances AI Security Architecture for Adoption Risks
HPE introduces SRX400 Series Firewalls, expanded hybrid mesh security, and AI governance capabilities to secure AI adoption. Features include AI app visibility, prompt-level inspection, and identity-based protection to mitigate data exposure risks.
NVIDIA Donates GPU Dynamic Resource Allocation Driver to Kubernetes Community
NVIDIA donated its GPU Dynamic Resource Allocation (DRA) driver to the CNCF, making it an upstream Kubernetes project. This move aims to shift the core control point of GPU orchestration from proprietary vendor layers to the open-source community, and drive standardization in collaboration with major cloud providers.
NVIDIA Donates GPU Dynamic Resource Allocation Driver to Kubernetes
NVIDIA donated its GPU dynamic resource allocation driver to CNCF, supporting MPS and MIG technologies for intelligent GPU sharing and dynamic reconfiguration. Also added GPU support to Kata Containers for AI workload isolation, with KAI Scheduler joining CNCF sandbox.
ARM and NVIDIA Drive Localization Revolution in AI Workstations
ARM and NVIDIA jointly launch DGX Spark AI workstations based on GB10 Grace Blackwell chips, with eight major OEMs releasing products simultaneously. The solution features unified memory architecture supporting 200B parameter models locally, with third-party tests showing 41% faster rendering and 3.2x AI processing speed versus x86 alternatives, enabling seamless cloud-to-edge toolchain migration.
Check Point AI Factory Blueprint: Security Control Shifts to NVIDIA DPU and LLM Layer
Check Point unveils AI Factory Security Blueprint, tightly integrating its firewall with NVIDIA BlueField DPU via DOCA. The architecture enforces security at four layers: LLM, AI infrastructure, perimeter, and workload. The new AI Factory Firewall delivers hardware-accelerated threat prevention without consuming CPU/GPU cycles, aiming to embed security into the AI fabric.
Check Point Releases AI Factory Security Blueprint Covering GPU to LLM Protection
Check Point introduces an AI Factory security architecture blueprint, establishing full-stack protection from GPU hardware layer to LLM prompt layer through a zero-trust framework.
Check Point Releases AI Factory Security Blueprint with Layered Protection Architecture
Check Point released an AI Factory Security Blueprint defining an end-to-end security framework from GPU infrastructure to model governance. The architecture embeds security measures throughout the AI development and operations lifecycle, addressing risks like data poisoning and model theft.
NVIDIA Blackwell Architecture Achieves 25x Energy Efficiency Gain
NVIDIA's Blackwell GPU architecture delivers 25x energy efficiency improvement over Hopper through Transformer Engine and NVLink innovations. This architectural breakthrough significantly reduces AI training/inference operational costs, directly impacting data center TCO and sustainability metrics.
NVIDIA CEO Outlines Accelerated Computing Paradigm, Signaling AI Infrastructure Evolution
In an interview, NVIDIA CEO Jensen Huang systematically elaborated on accelerated computing as a fundamental shift in computer architecture. He emphasized the data center's transition from general-purpose CPUs to specialized acceleration platforms led by GPUs, and believes the future computing stack will be re-architected around accelerated computing.
NVIDIA Outlines Three-Stage Accelerated Computing Evolution and Software-Defined Data Center Strategy
NVIDIA CEO outlined a three-stage accelerated computing evolution, progressing from single GPU acceleration to full-stack acceleration, and now entering the software-defined, AI-driven data center phase. The company emphasizes dynamic resource allocation through software-defined infrastructure and reaffirms its full-stack AI strategy from chips to applications.
Cisco and NVIDIA Embed Firewall in DPU for AI Server Security
Cisco extends its Hybrid Mesh Firewall to NVIDIA BlueField DPU, enabling 400G line-rate stateful segmentation security. The solution deploys security capabilities inside AI servers with hardware acceleration to avoid CPU/GPU resource consumption. Designed for AI front-end networks, it supports multi-tenant isolation and automated policy generation.
AMD Defines Agent Computer Vision for Edge AI Architecture
AMD releases 2026 AI PC roadmap, proposing Agent Computer concept with expanded Ryzen AI stack featuring NPU-GPU-CPU heterogeneous architecture. Enables local multimodal AI agents, shifting PC from productivity tool to proactive AI partner.
AMD Highlights CPU's Critical Role in Agentic AI Orchestration and Inference
AMD states Agentic AI workloads require serial decision-making and context management, better suited for CPUs. The company emphasizes high-core-count, high-memory-bandwidth server CPUs will lead in agent orchestration and lightweight inference, complementing GPUs in training. This signals a strategic repositioning of CPUs in AI data center architecture.
AWS and Cerebras Introduce Decoupled Inference Architecture for AI Performance
AWS collaborates with Cerebras on a heterogeneous inference solution using Trainium and CS-3, featuring a decoupled architecture for compute and memory stages connected via EFA. It targets interactive AI applications with claimed 10x performance gain, deployed on Nitro-secured infrastructure.
Cisco UCS Integrates NVIDIA Blackwell GPU with Dynamic Resource Pooling
Cisco integrates NVIDIA RTX PRO 4500 Blackwell GPU into UCS platform, supporting deployment from data center to edge. Intersight management enables dynamic GPU resource pooling with real-time PCIe allocation. Validated design blueprints accelerate scalable AI inference and vision AI workloads.
AMD and NAVER Cloud Collaborate on Sovereign AI Infrastructure in Korea
AMD and NAVER Cloud announced a strategic collaboration to accelerate sovereign AI infrastructure in Korea. NAVER Cloud will expand deployment of AMD EPYC "Venice" CPUs and gain early access to next-gen Instinct MI455X GPUs, with joint optimization of AI services and software stacks on AMD platforms.
AMD and Samsung Deepen Collaboration, Locking HBM4 Supply and Exploring Foundry Partnership
AMD and Samsung signed an MOU, designating Samsung as the primary HBM4 supplier for the next-gen Instinct MI455X GPU and collaborating on DDR5 memory optimized for 6th Gen EPYC CPUs. The companies will also explore opportunities for Samsung to provide foundry services for future AMD products.
NVIDIA CloudXR Integrates Apple Vision Pro for Enterprise XR Streaming
NVIDIA's CloudXR platform now supports Apple Vision Pro, enabling high-fidelity XR content streaming from cloud or local workstations with RTX GPUs. This addresses mobile headset compute limitations for enterprise applications like industrial design and digital twins.