Skip to main content
Back to News
MiniMax M3 launch challenges closed-source frontier with Token Plan pricing and three-in-one capability
Technology
3 min read
CN

MiniMax M3 launch challenges closed-source frontier with Token Plan pricing and three-in-one capability

The AMW Read

Novelty 3 because M3 is the first open-weight model to simultaneously match closed-source leaders on long-context coding, native multimodal, and agentic workflows — overturning a named segment dynamic. Significance 2 because while impactful within the foundation model and coding segments, it does no
NoveltySignificance
Foundation Models · Case StudiesFoundation Models · Structural ForcesAI Coding · Player Map
MiniMax
MiniMax

Foundation Models / LLMs

View Company Profile

MiniMax M3 launch challenges closed-source frontier with Token Plan pricing and three-in-one capability

Chinese AI lab MiniMax has released the M3 model, a flagship foundation model that combines long-context (1M tokens), native multimodal (text+image from pretraining), and advanced coding capabilities in a single open-weight architecture. The model scores 59% on SWE-Bench Pro, surpassing GPT-5.5 and Gemini 3.1 Pro and approaching Opus 4.7. Its sparse attention mechanism (MiniMax Sparse Attention) reduces per-token computation to 1/20th of its predecessor while delivering 15x decoding speedup. Alongside the model, MiniMax launched MiniMax Code, a Claude Code competitor harness purpose-trained for M3. The company also introduced a Token Plan pricing model that initially sparked controversy but was quickly adjusted with higher weekly usage limits. Independent tests by quantumbit (量子位) showed M3 autonomously replicating an ICLR 2025 Outstanding Paper, building an interactive map of Jensen Huang's Beijing food tour, identifying all 74 logos from an Nvidia ComputeX slides, and understanding a 2-hour linguistics Olympiad tutorial video — all without human intervention.

Why it matters: M3 represents the first open-weight model to simultaneously achieve frontier-level performance on long-context coding, native multimodal reasoning, and autonomous agent workflows — a trifecta previously exclusive to closed-source models from OpenAI (GPT-5.5), Anthropic (Claude Opus 4.7), and Google (Gemini 3.1 Pro). This breaks the long-standing pattern where the most capable multi-capability models have remained behind API walls. The move signals that the open-weight ecosystem is now closing the gap on the most demanding compound-capability benchmarks, potentially accelerating developer migration toward open alternatives for production agentic workloads. The Token Plan pricing controversy and quick response also highlight the ongoing capital-compression dynamics in China's foundation-model market, where labs must balance aggressive pricing against sustainable compute cost recovery.

Grounded expert take: MiniMax's M3 validates two key substrate patterns. First, it exemplifies the fastest-ARR-ramp dynamic within the open-weight foundation model segment — MiniMax has moved from M2's capable-but-gapped coding to M3's frontier-level multi-capability stack in a single release cycle, compressing what historically took 12-18 months. Second, the combination of user-simulator-based agent training (simulating real developer collaboration cycles) and MSA sparse attention suggests MiniMax is betting on architecture-level differentiation rather than pure scale — a structural force that could reshape the moat analysis for foundation models if open-weight labs can match closed-source capability at 10x lower inference cost. The explicit targeting of the Claude Code developer experience with MiniMax Code also deepens the competition within the AI coding/DevTools segment, where product-level bundling of model + harness is becoming a new distribution moat.

#MiniMax #M3 #FoundationModels #OpenWeight #AICoding #TokenPlan #SparseAttention #ChinaAI

#MiniMax#M3#open-weight model#SWE-Bench#native multimodal#Token Plan#MiniMax Code#sparse attention

How This Connects

Based on Foundation Models · Case Studies

  1. 6h agoAnthropic releases Claude Fable 5; Microsoft restricts employee use over data retention concernsAnthropic
  2. 2d agoApple AI runs on Nvidia chips. At a WWDC 2026 tech talk, Apple disclosed that its Private Cloud Comp...
  3. 3d agoOpenAI proposes mandatory AI safety assessment framework, diverging from Trump administration's voluntary NSA-led approachOpenAI
  4. 4d agoOpenAI launched Lockdown Mode, an optional security setting that offers users enhanced protection ag...OpenAI
  5. 1w agoMiniMax M3 launch challenges closed-source frontier with Token Plan pricing and three-in-one capability · THIS ARTICLE
  6. 1mo agoMicrosoft and OpenAI renegotiate deal, ending cloud exclusivity and extending revenue share through 2032Microsoft

Related News

More news from MiniMax

Stay updated with the latest news and announcements from MiniMax.

View all MiniMax news

Discover AI Startups

Explore 2,000+ AI companies with VC-grade analysis, funding data, and investment insights.

Explore Dashboard