Skip to main content
Back to News
OpenAI and Broadcom debut custom AI inference chip 'Jalapeno' in nine-month design cycle
Technology
2 min read
US

OpenAI and Broadcom debut custom AI inference chip 'Jalapeno' in nine-month design cycle

The AMW Read

The custom chip is a new product from OpenAI, a known player in the infrastructure segment, representing a meaningful update to the inference hardware landscape (novelty 2) with potential cross-segment impact on model deployment costs and stack control (significance 3).
NoveltySignificance
AI Infra · Player MapSilicon Substrate
OpenAI
OpenAI

Foundation Models / LLMs

View Company Profile

OpenAI and Broadcom debut custom AI inference chip 'Jalapeno' in nine-month design cycle

On June 24, 2026, OpenAI and Broadcom announced the first custom-designed AI inference chip, named 'Jalapeno,' optimized for large language model inference. The chip, described as the first step in a multi-year roadmap, was designed and manufactured in just nine months using Broadcom's silicon expertise and Celestica's manufacturing capabilities. OpenAI stated that the chip is already running production inference workloads, including GPT-5.3-Codex-Spark, though specific performance and power figures were not disclosed.

Why it matters: The 'Jalapeno' chip signals a decisive move by OpenAI to vertically integrate its inference stack, reducing reliance on third-party GPU suppliers and potentially lowering inference costs significantly. This aligns with the 'hyperscaler vertical integration' pattern seen across top-tier AI labs, where control over silicon, model architecture, and runtime is consolidated to improve performance and margin. The nine-month design cycle also marks a notable acceleration in custom chip development, compressing what typically takes 18–24 months, and updates the competitive landscape for AI infrastructure players.

Expert take: This development places OpenAI in direct competition with other AI labs and hyperscalers that have pursued custom silicon, such as Google's TPU and Anthropic's reported chip partnership. The rapid turnaround—from design to production in under a year—suggests a mature design methodology and tight collaboration between OpenAI and Broadcom. If the chip yields the claimed 'dramatic improvement' in inference efficiency, it could pressure GPU incumbents like Nvidia and accelerate the industry's shift toward purpose-built inference hardware. However, scaling production across multiple facilities and proving reliability at high volume remain open questions.

#OpenAI #Broadcom #AIChip #Inference #CustomSilicon #Jalapeno

#OpenAI#Broadcom#Jalapeno#AI inference chip#custom silicon#LLM inference#vertical integration#Celestica

How This Connects

Based on AI Infra · Player Map

  1. 1d agoOpenAI and Broadcom debut custom AI inference chip 'Jalapeno' in nine-month design cycle · THIS ARTICLE
  2. 2d agoOpenAI and Broadcom have jointly unveiled a custom chip designed specifically for AI inference, mark...
  3. 1w agoQualcomm in talks to acquire Tenstorrent for up to $10 billion, gaining Jim Keller’s AI chip team. Q...Tenstorrent
  4. 2w agoGoogle to Pay SpaceX $30B for AI Compute. In a filing with the U.S. Securities and Exchange Commissi...
  5. 0mo ago**ByteDance develops custom CPUs for AI inference amid chip supply crunch**ByteDance
  6. 1mo agoNVIDIA launches HealDA, an AI-based data assimilation model for global weather analysisNVIDIA

Related News

More news from OpenAI

Stay updated with the latest news and announcements from OpenAI.

View all OpenAI news

Discover AI Startups

Explore 2,000+ AI companies with VC-grade analysis, funding data, and investment insights.

Explore Dashboard