OpenAI builds custom chip with Broadcom to control its own silicon stack

OpenAI and Broadcom confirm custom silicon deal

OpenAI announced its first custom-designed chip, developed in partnership with Broadcom. The company did not disclose the chip's specifications, timeline to production, or deployment targets (per CNBC reporting). The partnership frames OpenAI's move as part of a broader effort to "build the full stack," a phrase that typically means owning model development, software infrastructure, and now hardware.

This is not OpenAI's first hardware play. The company has invested in infrastructure partnerships before, but a formal chip announcement represents an escalation. Broadcom, a major semiconductor supplier to hyperscalers, manufactures custom silicon for other cloud companies including Meta and Amazon.

The economics of inference are forcing every large lab to own silicon

Custom silicon for frontier models is not optional anymore. Training costs are capped by model scale and flops; inference costs scale directly with usage. As models grow (GPT-4o, o1, future releases), inference margins collapse unless you control the hardware.

A few second-order effects matter: First, this is not about inventing novel chip designs. OpenAI is likely working from reference architectures that Broadcom and other partners have already proven at scale. The play is cost and control, not innovation in semiconductor design. Second, the Broadcom relationship suggests OpenAI is not betting on single-source dependency (Nvidia). Broadcom manufactures at TSMC and works with multiple customers, reducing OpenAI's exposure to supply-chain bottlenecks.

Third, and most important, OpenAI is signaling that chip margins matter to its business model. If the company were confident in its pricing power (selling API access at high margins), custom silicon would be a nice-to-have. The fact that Broadcom is involved suggests OpenAI is building for cost leadership in a market where multiple frontier labs now compete on API pricing and reliability.

What to do this week

If you are buying GPU capacity, lock in multi-year commitments now at fixed prices while they still exist. OpenAI's move will put downward pressure on Nvidia's pricing power within 18 months, but your current contract terms are negotiated in this environment, not the next one. If you are building a model deployment, assume inference costs will drop 20-40% within two years as custom silicon becomes standard. Plan your unit economics accordingly.

For infrastructure and cost leads: Broadcom's involvement in OpenAI's chip means other hyperscalers will follow. Expect similar announcements from Meta and Google within the next 12 months. These chips will not be faster than Nvidia's best offerings; they will be cheaper to operate and more integrated with proprietary models. If your organization has committed to a single inference platform (e.g., only Nvidia), this is the moment to add optionality in your infrastructure roadmap.

OpenAI builds custom chip with Broadcom to control its own silicon stack

Our Take

Why it matters

Do this week

OpenAI and Broadcom confirm custom silicon deal

The economics of inference are forcing every large lab to own silicon

What to do this week

Related stories

Jamendo sues Nvidia over AI training on unlicensed music

China's 360 Says It Built Tools to Match Anthropic's Mythos

Centari Tracks Deal Changes Across Amendments, Not Just Single Documents