N
NVIDIA
2026-04-22
Architecture Shift Impact: Major Strength: High Conf: 90%

NVIDIA and Google Cloud Deepen Collaboration to Build Cloud Infrastructure for AI Factories and Physical AI

Summary

NVIDIA and Google Cloud have announced an expanded collaboration, introducing new Vera Rubin and Blackwell GPU-powered instances to build "AI factories" scaling to nearly a million GPUs. The integration of Gemini, Nemotron, and other platforms aims to accelerate production deployment of agentic and physical AI, such as robotics and digital twins.

Key Takeaways

The collaboration announced at Google Cloud Next features multiple technical integrations. Key highlights include the new A5X bare-metal instances powered by NVIDIA Vera Rubin NVL72 rack-scale systems, achieving 10x lower inference cost per token and 10x higher token throughput per megawatt through extreme co-design.

Google Gemini models are now in preview on Google Distributed Cloud with NVIDIA Blackwell GPUs, featuring NVIDIA Confidential Computing to protect prompts and fine-tuning data. NVIDIA Nemotron open models and the NeMo framework are deeply integrated with Google's Gemini Enterprise Agent Platform, offering a complete path from model discovery and customization to deployment, including a new managed reinforcement learning API.

Why It Matters

This signals a shift in AI infrastructure from providing compute to offering end-to-end "AI factory" production environments. The deep integration between a cloud giant and the chip leader is establishing full-stack optimization for complex workflows—from training and inference to agents and physical AI—as the core control plane for next-generation enterprise AI deployment....

Sign up to view full strategic analysis

Sign Up Free

PRO Decision

🔒

Decision recommendations are available for Pro users

Upgrade to Pro $29/mo
Source: NVIDIA新闻中心
View Original →