Skip to main content
Back to News
Moonshot's new open-source Kimi K2 Thinking has reset the frontier, outperforming proprietary models...
Technology
1 min read
US

Moonshot's new open-source Kimi K2 Thinking has reset the frontier, outperforming proprietary models...

The AMW Read

Moonshot's K2 Thinking validates the Frame 2 debate by demonstrating that open-weight models can reset the frontier in agentic reasoning, fundamentally challenging proprietary dominance.
NoveltySignificance
Foundation Models · Player MapScaling Laws

Moonshot's new open-source Kimi K2 Thinking has reset the frontier, outperforming proprietary models in key agentic reasoning tasks. On the BrowseComp benchmark, K2 Thinking scored 60.2%, surpassing GPT-5's 54.9% and decisively beating Claude Sonnet 4.5. This high-performance open model, coupled with an API cost of just $0.15 per million input tokens, instantly democratizes top-tier agentic workflows. The LLM competition is now fundamentally a race for open source parity and real-world application cost dominance.

#AI #LLM #OpenSource #KimiK2 #AgenticAI #TechBreakthrough

How This Connects

Based on Foundation Models · Player Map

  1. 22h agoXiaomi launches MiMo-V2.5-Pro-UltraSpeed model achieving 1,000+ tokens/s throughput on general-purpose GPUsXiaomi
  2. 1d agoAnthropic releases Claude Fable 5; Microsoft restricts employee use over data retention concernsAnthropic
  3. 1w agoAlibaba (阿里巴巴) launches Qwen3.7-Plus multimodal large model. On June 2, 2026, Alibaba Cloud released...
  4. 2w ago## DeepSeek makes 75% price cut permanent on flagship modelDeepSeek
  5. 1mo agoDeepSeek V4 Preview: 1.6 Trillion Parameters, Open-Weight Challenge to Frontier LabsDeepSeek
  6. 7mo agoMoonshot's new open-source Kimi K2 Thinking has reset the frontier, outperforming proprietary models... · THIS ARTICLE

Related News

More news from Moonshot AI

Stay updated with the latest news and announcements from Moonshot AI.

View all Moonshot AI news

Discover AI Startups

Explore 2,000+ AI companies with VC-grade analysis, funding data, and investment insights.

Explore Dashboard