OpenAI 1970-01-01
Product Launch Impact: Major Conf: 85%

OpenAI and Broadcom Launch Jalapeño ASIC for LLM Inference, 9-Month Tapeout

Summary

OpenAI and Broadcom unveil Jalapeño, a custom ASIC for LLM inference, achieving tapeout in 9 months. The chip reduces data movement and claims superior performance per watt. Deployment planned by end of 2026, marking OpenAI's shift to integrated hardware-software infrastructure.

Key Takeaways

OpenAI and Broadcom announced Jalapeño, a custom ASIC designed specifically for LLM inference, achieving tapeout in 9 months. OpenAI used its own AI models to accelerate the design process. OpenAI handled architecture, Broadcom silicon implementation (including Tomahawk networking), and Celestica board/system integration. Engineering samples are running GPT-5.3, Codex, and Spark at target frequencies and power. Early tests show performance per watt exceeding current state-of-the-art. The architecture reduces data movement and balances compute, memory, and network resources for near-peak utilization. OpenAI calls Jalapeño the first step in a multi-generational compute platform, with initial deployment by end of 2026. Broadcom CEO states this is the start of a roadmap enabling Microsoft and partners to deploy gigawatt-scale data centers from 2026.

Why It Matters

Jalapeño is a strategic move to defend against NVIDIA's inference dominance and encircle rivals like Google and Anthropic. By customizing the ASIC for its models, OpenAI aims to lock in its ecosystem, forcing customers to use OpenAI's inference services. But the chip's rigidity is a hidden liability: it cannot adapt to evolving architectures (e.g., MoE, new attention mechanisms), risking asset depreciation. The 9-month tapeout suggests trade-offs in programmability, potentially causing tail latency issues, and reliance on Broadcom's Tomahawk networking creates another supply chain lock-in.

PRO Decision

【Vendors】NVIDIA should accelerate custom inference ASICs leveraging CUDA ecosystem, highlighting programmability and showing Jalapeño's limitations with model evolution. Launch efficient accelerators (e.g., H200, B200) and open optimization toolchains to weaken OpenAI's lock-in.
【Enterprises】CIOs and architects should conduct zero-trust audits on Jalapeño interoperability with existing GPU clusters and model migration costs. Prefer open standards (e.g., OpenAI API with swappable backends) or PCIe-based general accelerators to avoid Broadcom Tomahawk network dependency.
【Investors】Look beyond PR: Jalapeño's success hinges on OpenAI's model stability and Broadcom's capacity. Short-term NVIDIA pressure, but long-term custom ASICs are niche. Watch if OpenAI opens the chip to third parties; otherwise, it's a vertical integration story. Broadcom benefits but faces single-client risk.

Source: 第一财经
View Original →

Get 3-5 key AI infrastructure signals weekly →

💬 Comments (0)