Reports
AI-generated structured vendor updates
NVIDIA RTX Spark SoC Invades Windows PC: Arm CPU + GPU with 128GB Unified Memory Reshapes AI PC
At HPE Discover 2026, NVIDIA unveiled the RTX Spark SoC for Windows PCs, built on TSMC 3nm with a MediaTek-designed Arm CPU, 70B transistors, and up to 128GB unified memory. This marks NVIDIA's official entry into the PC SoC market, directly challenging Intel, AMD, and Qualcomm in the AI PC segment.
NVIDIA Blackwell Ultra GB300 NVL72: 1.44 EFLOPS FP4, 50x AI Factory Boost
NVIDIA launches Blackwell Ultra GB300 NVL72 rack system with 72 Blackwell Ultra GPUs and 36 Grace CPUs, delivering 1,440 PFLOPS FP4 sparse, 20TB HBM3e, 130TB/s NVLink. Claims 50x AI factory output over Hopper. Available now.
NVIDIA's Triple Play: Vera CPU, N1X Laptop Chip, and $6.5B Silicon Photonics Reshape AI Infra Control
NVIDIA delivers first agent-specific Vera CPU (88 Arm v9.2 cores, 1.2TB/s memory bandwidth), teases consumer N1X laptop chip, and invests $6.5B in silicon photonics. This shifts AI orchestration control from x86 to NVIDIA's Arm ecosystem, while CPO addresses memory wall, but volume production remains challenging until post-2028.
NVIDIA Collaborates with Energy Leaders to Position AI Factories as Smart Grid Assets
NVIDIA, in collaboration with Emerald AI, proposes treating large-scale AI data centers (AI factories) as flexible, intelligent grid assets rather than static power loads. This architecture integrates accelerated computing, power networking, and control to enhance grid reliability and optimize energy efficiency. Several major energy companies plan to collaborate on this architecture to support AI workloads and accelerate power connection.
NVIDIA Collaborates with Energy Leaders on AI Factory-Grid Integration Architecture
NVIDIA and Emerald AI introduced a new architecture treating AI factories as intelligent grid assets, combining accelerated computing, real-time energy orchestration and reference designs. The Vera Rubin DSX-based approach enables dynamic grid response and has gained support from multiple energy providers.
NVIDIA Unveils Physical AI Data Factory Blueprint and Frontier Models
NVIDIA launched three physical AI frontier models and an open Physical AI Data Factory reference architecture at GTC 2026, converting computation into synthetic training data via Cosmos world model and OSMO operators. The Omniverse DSX digital twin blueprint enables validation and real-time AI inference integration with Jetson modules.
NVIDIA and Emerald AI Demonstrate Dynamic Energy Adjustment in AI Factories
NVIDIA partners with Emerald AI to demonstrate grid-responsive energy management on a 96 Blackwell Ultra GPU cluster, using NVIDIA System Management Interface for real-time power telemetry and Emerald AI Conductor to dynamically adjust energy use while maintaining high-priority AI workload performance.
NVIDIA Defines Flexible AI Factory as Dispatchable Grid Asset
NVIDIA partners with energy firms to introduce Flexible AI Factory concept, using AI platform to dynamically align computing loads with grid demand. This transforms AI data centers from energy consumers to prosumers with grid support capabilities through software-defined optimization.
Check Point Releases AI Factory Security Blueprint Covering GPU to LLM Protection
Check Point introduces an AI Factory security architecture blueprint, establishing full-stack protection from GPU hardware layer to LLM prompt layer through a zero-trust framework.
Check Point Releases AI Factory Security Blueprint with Layered Protection Architecture
Check Point released an AI Factory Security Blueprint defining an end-to-end security framework from GPU infrastructure to model governance. The architecture embeds security measures throughout the AI development and operations lifecycle, addressing risks like data poisoning and model theft.
HPE Launches AI Grid with NVIDIA to Unify Distributed Inference Clusters
HPE announced the AI Grid at NVIDIA GTC, an end-to-end solution built on NVIDIA's reference architecture to securely connect distributed AI factories and inference clusters into a single intelligent system. It enables service providers to deploy and operate thousands of edge inference sites, meeting the predictable, low-latency infrastructure requirements of AI-native applications.
NVIDIA Mass Produces Dynamo 1.0 Inference OS, Strengthening AI Factory Platform Strategy
NVIDIA begins mass production of Dynamo 1.0 inference OS, providing a unified software layer to coordinate AI inference workloads across data centers, cloud and edge. The system simplifies large-scale AI model deployment through standardized runtime and scheduler, abstracting infrastructure management.
Cisco Expands Secure AI Factory with NVIDIA to Edge and Security
Cisco expands its Secure AI Factory with NVIDIA to enable AI deployment from data centers to edge sites, adding security capabilities like firewall policy enforcement on DPUs and AI Defense integration, offering flexible architecture options to accelerate production scaling.
NVIDIA Releases AI Factory Reference Design and Digital Twin Blueprint
NVIDIA unveiled Vera Rubin DSX AI factory reference design and Omniverse DSX digital twin blueprint, built on Spectrum-X Ethernet, Quantum-X800 InfiniBand and BlueField-3 DPU. The architecture connects real-world sensors with digital twins for continuous AI model training and optimization, extending AI computing from data centers to physical world automation.
HPE Deepens AI Factory Partnership with NVIDIA, Unveils Full-Stack Supercomputing Solutions
At GTC 2026, HPE announced enhancements to its NVIDIA AI Computing portfolio, introducing full-stack solutions for large-scale AI factories and supercomputers. The offerings integrate compute, GPUs, networking, liquid cooling, software, and services to improve deployment efficiency and time-to-insight.
HPE Deploys Sovereign AI Factories with NVIDIA at National Labs
HPE announces a collaboration with NVIDIA to deploy liquid-cooled sovereign AI systems at Argonne National Laboratory in the U.S. and HLRS in Germany. This move aims to provide government and research institutions with AI infrastructure that meets data sovereignty and compliance requirements, accelerating the deployment and scaling of their AI initiatives.
NVIDIA Proposes Five-Layer AI Cake Theory Defining Infrastructure Buildout Framework
NVIDIA CEO presented a five-layer AI development framework at Davos, systematically outlining full-stack construction from energy infrastructure, compute infrastructure, AI models, AI applications to industry AI factories. The framework emphasizes hierarchical synergistic development driven by generative AI, providing an ecosystem perspective for enterprise AI strategy planning.
Palo Alto Networks Advocates Service Provider Shift to Secure AI Factory
Palo Alto Networks proposes service providers transform into 'secure AI factories' by building integrated platforms for AI development, deployment, governance, and security. The platform emphasizes embedded security layers for proactive protection against model poisoning and data leaks, repositioning security from cost to business enabler.
NVIDIA Collaborates with Eli Lilly to Build AI Pharma Factory
NVIDIA partners with Eli Lilly to establish an AI-powered pharmaceutical factory, utilizing GPU and AI software to accelerate biomolecular simulation and drug design. This represents AI's evolution from辅助工具 to core production infrastructure.
Cisco Partners with NVIDIA and VAST on End-to-End Secure AI Data Platform Architecture
Cisco partnered with NVIDIA and VAST to deliver a deployable AI data platform reference architecture integrating compute infrastructure, data platform, and security layers. The architecture employs Cilium for K8s networking, Tetragon for runtime security, and AI Defense for application protection, enabling full lifecycle security from data to AI applications.