N
NVIDIA
2026-06-17
Product Launch Impact: Major Conf: 85%

NVIDIA and HPE Expand AI Factory with Vera CPU for Agentic AI, Full-Stack Integration

Summary

NVIDIA and HPE expand the HPE AI Factory with the Vera CPU, the first CPU built for agentic AI, plus the NVIDIA Agent Toolkit, Confidential Computing, and full-stack NVIDIA integration (Spectrum-X, BlueField, ConnectX). This turnkey solution targets enterprise agentic AI production, locking customers into NVIDIA's hardware-software stack.

Key Takeaways

NVIDIA and HPE expand the HPE AI Factory with key components:

  • NVIDIA Vera CPU: First CPU for agentic AI, optimized for tool calls, orchestration, and real-time data processing, delivering deterministic low-latency performance. Available in HPE ProLiant DL394 Gen12 in 2027.
  • NVIDIA Agent Toolkit: Includes Nemotron open models, OpenShell secure runtime, and NemoClaw blueprints, integrated with HPE Private Cloud AI for monitoring, governance, and safe multi-agent systems. HPE Zerto adds rogue agent detection and rollback.
  • NVIDIA Confidential Computing: Extended across all HPE AI Factory solutions via BlueField DPUs and DOCA, providing hardware-based zero-trust enforcement, threat detection, and network encryption.
  • Full-Stack Integration: All solutions feature RTX PRO 6000 Blackwell, Spectrum-X Ethernet, BlueField-3 DPU, ConnectX-8 SuperNIC. Vera Rubin NVL72 will include BlueField-4 DPU, ConnectX-9 SuperNIC, and Spectrum-6 switch, delivering 1.6x higher AI networking performance. InfiniBand options also available.

Why It Matters

This move is a control grab: NVIDIA shifts the control point from general-purpose CPUs to its own Vera CPU+GPU+DPU+network stack. The lock-in is through:

  • Vera CPU's proprietary instruction set and Agent Toolkit, making agentic AI workloads dependent on NVIDIA toolchains, hindering migration to AMD/Intel.
  • Spectrum-X and BlueField DPU with proprietary protocols (enhanced RoCEv2) and DOCA API, locking network control plane and blocking standard Ethernet alternatives. The claimed 1.6x performance masks incompatibility and high DPU licensing costs.
  • Confidential Computing forces BlueField DPU as root of trust, adding hardware dependency and undisclosed tail latency for real-time agent inference.
  • Hides Vera CPU's weak general-purpose performance, forcing mixed-workload enterprises to split infrastructure, increasing TCO.

PRO Decision

[Vendors] (AMD/Intel/Arista/Broadcom): Immediately launch open-standard agentic AI reference architectures using AMD EPYC CPU + Pensando DPU + standard Ethernet (RoCEv2), and open-source agent runtimes (e.g., Ray Serve) to counter NVIDIA's Agent Toolkit. Highlight Vera CPU's weakness in general workloads through independent benchmarks (SPEC CPU).
[Enterprises] (CIOs/Architects): Demand independent benchmarks for Vera CPU on mixed workloads, especially tail latency and token throughput per dollar. Assess Spectrum-X interoperability with existing standard Ethernet (Arista 7800R3/Broadcom Tomahawk) and long-term licensing costs. Contractually guarantee cross-cloud portability via Kubernetes + standard CNI, avoiding mandatory BlueField DPU and DOCA lock-in.
[Investors]: Watch for vendor concentration risk as NVIDIA tightens ecosystem via Vera CPU and full-stack integration. Monitor AMD's Pensando DPU progress and Intel's Sierra Forest+IPU as open alternatives. Short-term revenue growth likely, but long-term lock-in may invite antitrust scrutiny and customer pushback.

Source: NVIDIA新闻中心
View Original →

Get 3-5 key AI infrastructure signals weekly →

💬 Comments (0)