Reports
AI-generated structured vendor updates
NVIDIA Forms Open Model Alliance to Advance Nemotron Ecosystem
NVIDIA launched the first open frontier model alliance with Mistral AI to co-develop foundation models. Members share data, compute and expertise for post-training support, with Nemotron models exceeding 45M downloads. This move aims to advance open model innovation against closed ecosystems.
Intel Launches 18A Process Commercial PC Platform with Enhanced AI Inference
Intel launches Core Ultra 3 series commercial processors on 18A process, delivering 4x AI performance improvement. Arc Pro B70 GPU optimized for enterprise AI workloads outperforms competitors in context window and multi-user response. vPro platform deep integration with Intune enhances device management.
NVIDIA Demonstrates AI Factories as Flexible Grid Assets for Peak Demand Management
NVIDIA, in collaboration with EPRI, National Grid, and Emerald AI, demonstrated how AI factories powered by Blackwell GPU clusters can dynamically adjust power consumption in response to grid signals. This allows them to act as 'shock absorbers' during peak demand while maintaining performance for high-priority AI workloads.
NVIDIA and Emerald AI Demonstrate Dynamic Energy Adjustment in AI Factories
NVIDIA partners with Emerald AI to demonstrate grid-responsive energy management on a 96 Blackwell Ultra GPU cluster, using NVIDIA System Management Interface for real-time power telemetry and Emerald AI Conductor to dynamically adjust energy use while maintaining high-priority AI workload performance.
Cisco Validates Rapid Fine-tuning on Private AI Infrastructure with NVIDIA
Cisco IT partnered with NVIDIA to achieve 2-5 hour end-to-end embedding model fine-tuning using Nemotron RAG recipe on a single H200 GPU. The solution uses 120B parameter local LLM for synthetic data generation without manual labeling, improving NDCG@1 by 7.3 absolute points. Validates rapid domain-specific retrieval optimization on private AI infrastructure.
Meta Partners with Arm to Develop New AI Data Center CPUs
Meta partners with Arm to co-develop data center CPUs optimized for AI workloads. The first product, the Arm AGI CPU, aims to boost rack performance density for large-scale AI deployments. It will be available through Arm's ecosystem, with board designs to be open-sourced via the Open Compute Project.
ARM Launches AGI CPU for Agentic AI Infrastructure Era
ARM introduces the Arm AGI CPU, its first silicon product, designed for agentic AI infrastructure on Neoverse. Optimized for massively parallel workloads, it supports 272 cores per blade in a 1OU design, delivering 8160 cores per rack and over 2x performance vs. x86 systems.
ARM Launches AGI CPU Silicon for AI Infrastructure Market
ARM introduced its first production AGI CPU silicon in March 2026, marking a strategic shift from IP licensing to full silicon solutions provider. Designed for next-gen AI infrastructure, this move may reshape the data center processor ecosystem.
Arm Neoverse Reshapes Control Layer in AI Infrastructure
ARM introduces Neoverse infrastructure CPU cores optimized for cloud, AI, and HPC workloads, adopted by NVIDIA, AWS, Microsoft, and Google for their AI platforms, delivering performance gains and energy efficiency. This architecture enables high-density AI workload deployment in cloud and edge environments with enhanced multi-tenant security.
NVIDIA Donates GPU Dynamic Resource Allocation Driver to Kubernetes Community
NVIDIA donated its GPU Dynamic Resource Allocation (DRA) driver to the CNCF, making it an upstream Kubernetes project. This move aims to shift the core control point of GPU orchestration from proprietary vendor layers to the open-source community, and drive standardization in collaboration with major cloud providers.
NVIDIA Donates GPU Dynamic Resource Allocation Driver to Kubernetes
NVIDIA donated its GPU dynamic resource allocation driver to CNCF, supporting MPS and MIG technologies for intelligent GPU sharing and dynamic reconfiguration. Also added GPU support to Kata Containers for AI workload isolation, with KAI Scheduler joining CNCF sandbox.
ARM and NVIDIA Drive Localization Revolution in AI Workstations
ARM and NVIDIA jointly launch DGX Spark AI workstations based on GB10 Grace Blackwell chips, with eight major OEMs releasing products simultaneously. The solution features unified memory architecture supporting 200B parameter models locally, with third-party tests showing 41% faster rendering and 3.2x AI processing speed versus x86 alternatives, enabling seamless cloud-to-edge toolchain migration.
NVIDIA Launches OpenShell, Establishing Runtime Sandbox for Secure Autonomous AI Agents
NVIDIA introduces OpenShell, an open-source project designed as a secure-by-design runtime for autonomous AI agents. It employs a "browser tab" model, isolating agent operations from policy enforcement at the system level to prevent policy overrides and data leaks. NVIDIA is collaborating with key security vendors to establish a unified policy layer for enterprise AI agents.
NVIDIA Launches OpenShell Open-Source Runtime for AI Agent Security Isolation
NVIDIA introduces OpenShell open-source runtime providing system-level sandbox isolation for autonomous AI agents, separating application operations from infrastructure policy enforcement. Partners with Cisco, Google Cloud to establish unified runtime policy management. Releases NemoClaw reference stack for simplified deployment.
Cisco Launches AI Agent Security Suite with Proactive Built-in Protection
Cisco introduced AI Defense Explorer Edition for red team testing, Agent Runtime SDK for policy embedding, open-source DefenseClaw framework, and extended zero trust to AI agents. The multi-layered approach shifts from reactive to proactive built-in security for AI agents.
Cisco Launches DefenseClaw Runtime Security Governance Layer for OpenClaw
Cisco launches open-source DefenseClaw providing runtime security governance for OpenClaw AI agents. The solution integrates scanning tools and threat detection capabilities for pre-execution scanning, runtime monitoring, and enforcement controls. It automates security governance to reduce AI agent deployment risks.
Cisco Extends Zero Trust Security to AI Agent Ecosystem
At RSA 2026, Cisco introduced security innovations for AI agents, extending Zero Trust Access with agent discovery in Identity Intelligence, agentic IAM in Duo, and MCP enforcement in Secure Access SSE. It launched AI Defense: Explorer Edition for self-serve testing and DefenseClaw open source framework to automate security deployment.
Cisco Extends Zero Trust to AI Agents and Launches AI Defense Tools
Cisco extends zero trust access to AI agent identity management via Duo IAM and Secure Access SSE for granular control. Launches AI Defense Explorer for self-service red teaming and security validation, and open-sources DefenseClaw framework with NVIDIA sandbox integration. Splunk SOC adds AI agent capabilities for automated operations.
NVIDIA Defines Flexible AI Factory as Dispatchable Grid Asset
NVIDIA partners with energy firms to introduce Flexible AI Factory concept, using AI platform to dynamically align computing loads with grid demand. This transforms AI data centers from energy consumers to prosumers with grid support capabilities through software-defined optimization.
Check Point AI Factory Blueprint: Security Control Shifts to NVIDIA DPU and LLM Layer
Check Point unveils AI Factory Security Blueprint, tightly integrating its firewall with NVIDIA BlueField DPU via DOCA. The architecture enforces security at four layers: LLM, AI infrastructure, perimeter, and workload. The new AI Factory Firewall delivers hardware-accelerated threat prevention without consuming CPU/GPU cycles, aiming to embed security into the AI fabric.