
Moonshot’s open-source Kimi K2 Thinking just surpassed GPT-5 and Claude Sonnet 4.5 in key agentic re...
The AMW Read
The release of a trillion-parameter open-weight model that outperforms proprietary SOTA (GPT-5/Claude 4.5) on agentic benchmarks validates the CN/OSS challenger frame and challenges the closed-lab moat.
NoveltySignificance
Foundation Models · Player MapScaling Laws
Moonshot’s open-source Kimi K2 Thinking just surpassed GPT-5 and Claude Sonnet 4.5 in key agentic reasoning benchmarks. The model's 60.2% on BrowseComp decisively leads GPT-5’s 54.9%. This trillion-parameter, non-paywalled release fundamentally challenges the proprietary SOTA models. The era of elite performance being exclusive to closed labs is rapidly ending. This breakthrough significantly accelerates the global democratization of cutting-edge AI capability.

