NVIDIA 2026-06-29
Vendor Strategy Impact: Major Conf: 85%

Jia Yangqing exits NVIDIA as DGX Lepton shutdown reveals software layer failure

Summary

Jia Yangqing leaves NVIDIA after DGX Lepton underperforms and open-source commitments are broken. NVIDIA acquired Lepton AI for ~$700M, rebranded as DGX Cloud Lepton, but service ceased mid-2025. The event signals NVIDIA's failed software layer expansion, shifting control back to hyperscalers.

Key Takeaways

According to SemiAnalysis, Jia Yangqing, former Alibaba Cloud VP and Lepton AI founder, has left NVIDIA after only one year since NVIDIA CEO Jensen Huang acquired the 20-person startup for ~$700M. The immediate trigger was the underperformance of DGX Lepton, which effectively ceased operations by mid-2025. A deeper conflict arose over open-source commitments: NVIDIA promised to open-source the Lepton core platform by 2026 but later vetoed the plan. As the creator of Caffe, PyTorch, and ONNX, Jia's decades-long dedication to open-source clashed with NVIDIA's reversal. Combined with the ~$1.2B acquisition of Run:ai, NVIDIA has invested nearly $2B in GPU cloud software stack, aiming to build a software layer atop AWS, Azure, and other hyperscalers. Jia's departure signals that GPU hardware monopoly cannot easily extend to the software layer.

Why It Matters

This is not just a personnel change but a failure of NVIDIA's control plane shift strategy. By acquiring Lepton AI and Run:ai, NVIDIA aimed to build an orchestration layer atop hyperscalers, locking users into DGX Cloud. The shutdown of DGX Lepton and broken open-source promises expose critical weaknesses in operational stability and ecosystem openness.

Second-order insight: NVIDIA concealed the engineering complexity and cultural clash of integrating Lepton's lightweight open-source tools into its proprietary hardware stack. Jia's exit cripples DGX Cloud's software innovation; Run:ai alone cannot sustain the cloud ambition. Enterprises face stranding risk if they rely on DGX Cloud, and the failure reveals NVIDIA's multi-tenant GPU scheduling deficiencies despite its CUDA dominance.

PRO Decision

【Vendors】Competitors like AMD, Intel, and AWS should exploit this signal by promoting open GPU orchestration standards (e.g., OpenStack Zun, Kubernetes GPU Operator), highlighting the instability and closed nature of DGX Cloud. AMD can partner with cloud providers to offer ROCm-based GPU cloud services, attacking NVIDIA's software weakness.

【Enterprises】CIOs and architects must perform zero-trust technical audits: assess dependency on DGX Cloud or Run:ai, and develop multi-cloud GPU scheduling contingency plans. Avoid locking AI workloads into NVIDIA's proprietary cloud; prioritize cloud-native tools like AWS ParallelCluster or Azure CycleCloud to ensure cross-cloud portability.

【Investors】Recognize that NVIDIA's hardware monopoly does not extend to software. The ~$2B acquisitions failed to generate expected software revenues, exposing cultural clashes and operational gaps. Watch for NVIDIA's ability to build software internally. Near-term, hyperscalers (AWS, Azure) gain bargaining power in GPU cloud services.

Source: 快科技
View Original →

Get 3-5 key AI infrastructure signals weekly →

💬 Comments (0)