Stepfun
Category: Foundation Models / LLMs
Stepfun (阶跃星辰) is a leading Chinese AI startup dedicated to developing trillion-parameter multimodal foundation models and AGI infrastructure, known as one of China's 'AI Six Little Tigers.' Stepfun was founded in 2023. The company is led by Daxin Jiang. Based in Shanghai, China. Team size: 200-500. Total funding raised: $1,300.0M. Latest round: Series B+ ($704.0M, Jan 2026). Key investors include ["Shanghai State-Owned Capital Investment","Tencent","Meituan","Qiming Venture Partners","5Y Capital","China Life Equity Investment","Pudong Venture Capital","Alibaba"].
- Founded
- 2023
- Headquarters
- Shanghai, China
- Team size
- 200-500
- Total funding
- $1,300.0M
Value proposition
Delivers native multimodal intelligence at scale, combining massive parameter counts (trillion-scale) with high inference efficiency for real-world industry applications.
Products and solutions
["Step-1 & Step-1V: 100B-parameter language and multimodal models","Step-2: 1.3T-parameter Mixture of Experts (MoE) foundation model","Step-3 Series: 321B-parameter MoE model with native multimodal reasoning","Step-Video-T2V & Step-Audio: Specialized open-source models for video generation and audio interaction","Yuewen (跃问): Consumer-facing AI personal assistant and chatbot","Maopao Ya (冒泡鸭): AI-powered character role-playing and companion app","Stepfun Open Platform: API services for developers and enterprise model customization"]
Unique value
Stepfun distinguishes itself by being one of the first in China to successfully train a trillion-parameter MoE model while maintaining 'Native Multimodality'—integrating vision, audio, and text at the architectural level rather than as separate modules.
Target customer
Enterprise clients (Smartphone OEMs, Automotive, Finance), independent developers, and individual consumers.
Industries served
["Consumer Electronics (Smartphones/IoT)","Automotive (Intelligent Cockpits)","Financial Services","Retail & E-commerce","Digital Entertainment & Content Creation"]
Technology advantage
Leverages a proprietary MFA (Attention mechanism) to reduce KV cache overhead and power-efficient MoE (Mixture of Experts) architectures. The company demonstrates a 'Scaling Law' leadership, with models like Step-3 achieving 300% higher inference efficiency on domestic chips compared to competitors like DeepSeek-R1.
How they differentiate
Stepfun differentiates through a commitment to 'Native Multimodality' and the 'Scaling Law,' being one of the first in China to successfully train a trillion-parameter Mixture of Experts (MoE) model (Step-2). Unlike competitors focusing solely on text or context length, Stepfun integrates vision, audio, and text at the architectural level and targets 'AI+Terminal' applications for automobiles and smartphones.
Main competitors
["Moonshot AI (Kimi)","Zhipu AI","MiniMax","DeepSeek"]
Key partnerships
["Geely Auto: Jointly open-sourced Step-Video and Step-Audio models for automotive applications.","OPPO & Honor: Integration of Step-series models into next-generation flagship smartphones.","Shanghai State-Owned Capital Investment: Strategic backing and resource alignment.","Mo-Chip Ecosystem Innovation Alliance: Collaboration with domestic chip manufacturers to optimize model-hardware synergy."]
Notable customers
["Geely Auto","OPPO","Honor","Huanrui Century"]
Major milestones
["Founded in April 2023 by former Microsoft VP Daxin Jiang","Launched Step-2, China's first trillion-parameter MoE model in March 2024","Reached unicorn status with $2B valuation in June 2024","Announced strategic 'AI+Terminal' partnership with Geely Auto in Feb 2025","Original Megvii CEO Yin Qi joined as Chairman in Jan 2026","Closed record-breaking 5 billion RMB B+ round in Jan 2026"]
Growth metrics
API call volume increased 45x within a 6-month period in 2024-2025; valuation reached $2 billion by mid-2024 before the massive B+ round.
Market positioning
Tier-1 Chinese Generative AI unicorn; one of the 'AI Six Little Tigers' leading China's AGI race.
Geographic focus
Mainland China (Primary), Global (Open-source and API services)
Patents and IP
Holds proprietary intellectual property in MFA Attention mechanisms, high-efficiency vision encoders (reducing visual tokens by 16x), and GUI Agent technologies (GELab-Zero).
About Daxin Jiang
Daxin Jiang (姜大昕) is a distinguished AI leader and former Corporate Vice President at Microsoft. He spent over 16 years at Microsoft, most recently as the Chief Scientist at Microsoft Asia Internet Engineering Research Institute (STCA). He led the engineering and research for Bing search, Cortana, and NLP systems for Microsoft 365 and Azure Cognitive Services. He is a globally recognized expert in data mining and NLP with over 100 research papers published.
Official website: https://www.stepfun.com/