Inception Labs has launched Mercury 2, the first reasoning diffusion large language model (dLLM), ac...

The AMW Read

The introduction of a 'reasoning diffusion' architecture (dLLM) challenges the transformer-centric scaling paradigm and fundamentally shifts the inference economics of the Foundation Model segment.

NoveltySignificance

Foundation Models · Player MapCompute Economics

Midjourney

Foundation Models / LLMs

View Company Profile

Inception Labs has launched Mercury 2, the first reasoning diffusion large language model (dLLM), achieving 1,009 tokens per second on NVIDIA Blackwell GPUs. This is 5x faster than leading speed-optimized LLMs while running on standard hardware, priced at just $0.25 per million input tokens. Founded by Stanford professor Stefano Ermon, co-inventor of diffusion methods powering Midjourney and Stable Diffusion, Inception applies parallel token generation to text instead of traditional sequential processing. This architectural breakthrough could fundamentally reshape AI inference economics, enabling real-time applications like voice assistants and coding tools without custom silicon, signaling a shift from scaling transformer architectures to reimagining model design.

#AI #LLM #DiffusionModels #GenerativeAI #TechInnovation

Explore Midjourney Read Original

How This Connects

Based on Foundation Models · Player Map

4h ago**Google Could Invest Another $40 Billion in Anthropic**Google
1d ago## OpenAI drops GPT-4.5 Omni and o3, igniting the next AI pricing warOpenAI
2d agoAnthropic's Mythos AI triggers global regulatory alarm over cyber vulnerabilitiesAnthropic
2d agoDeepSeek unveils V4 model using Huawei chips, undercuts US labs on price.DeepSeek
1mo agoThinking Machines Lab, founded by former OpenAI CTO Mira Murati, secured a multi-year partnership wi...Thinking Machines Lab
1mo agoInception Labs has launched Mercury 2, the first reasoning diffusion large language model (dLLM), ac... · THIS ARTICLE

Inception Labs has launched Mercury 2, the first reasoning diffusion large language model (dLLM), ac...

The AMW Read

#AI #LLM #DiffusionModels #GenerativeAI #TechInnovation

How This Connects

Related News

Recraft V4 launched February 17, 2026 with native SVG vector generation, 2048x2048 resolution, and p...

More news from Midjourney

Discover AI Startups