Skip to main content
Back to News
Technology
2 min read
US

Thinking Machines Lab unveils full-duplex AI vision, three months after Chinese rival open-sourced similar tech

The AMW Read

The article introduces a new entrant (TML) to segment 01 and details a novel full-duplex architecture, but the core concept was already published by a Chinese lab, making this an incremental confirmation rather than a breakthrough.
NoveltySignificance
Foundation Models · Player Map

Thinking Machines Lab unveils full-duplex AI vision, three months after Chinese rival open-sourced similar tech

Thinking Machines Lab (TML), the startup founded by former OpenAI CTO Lilian Weng, published its first technical vision today: a full-duplex real-time conversational AI model that can process and generate multiple modalities simultaneously. The company released a demo video showing the model engaging in seamless, interruptible dialogue. However, the article notes that Chinese firm Shengwu Intelligence (面壁智能) released and open-sourced its MiniCPM-o 4.5 model with the same core capability—an 'Omni-Flow' framework based on time-aligned micro-turns—three months earlier in February 2026.

Why it matters: This is a textbook 'concept-engineering moat' pattern applied to the emerging full-duplex AI interaction paradigm. TML's announcement validates the technical direction pioneered by a lesser-known Chinese lab, confirming that the next frontier in human-AI interaction is the transition from turn-based (VAD) to continuous, streaming multimodal dialogue. The three-month lead and open-source delivery by Shengwu Intelligence undercuts TML's narrative leap, forcing the question of whether Western labs can sustain a first-mover advantage when Chinese competitors are willing to fully open their code. For enterprise buyers evaluating voice AI platforms, the existence of an open full-duplex baseline compresses moats built on closed demos.

Grounded expert take: The convergence on a nearly identical architecture—time-aligned micro-turn processing of audio, video, and text tokens—between two independent teams is a strong signal that this is the dominant design for real-time multimodal AI. Shengwu's earlier delivery with an Apache 2.0 license turns this into a democratic capability, not a proprietary differentiator. The article positions Shengwu as the 'true first mover,' and TML as a high-profile verifier—a dynamic that benefits the open-source ecosystem more than any single lab. The real competition now shifts from who shows the demo to who can scale reliable, low-latency inference at the edge and integrate into devices.

#FullDuplexAI #MultimodalInteraction #OpenSourceAI #ChineseAI #RealTimeVoice #ThinkingMachinesLab

#Thinking Machines Lab#full-duplex AI#Shengwu Intelligence#MiniCPM-o 4.5#real-time multimodal

How This Connects

Based on Foundation Models · Player Map

  1. 19h agoMoonshot AI and Stepfun Secure Over 30 Billion Yuan (~$4.2B) in Combined Funding in MayMoonshot AI
  2. 1d agoDeepSeek permanently reduces V4-Pro API price to promotional level, with JD.com, NetEase, and CATL a...DeepSeek
  3. 1d agoDeepSeek seeks $10B funding to pursue AGI, rivaling OpenAI with research-first strategy.DeepSeek
  4. 1d agoAnthropic nears US$30 billion funding round, surpassing OpenAI as most valuable AI startupAnthropic
  5. 1w agoDeepSeek seeking $7.4B round at $52B valuation as founder commits $2.9B personallyDeepSeek
  6. 1w agoThinking Machines Lab unveils full-duplex AI vision, three months after Chinese rival open-sourced similar tech · THIS ARTICLE

Related News

More news from Thinking Machines Lab

Stay updated with the latest news and announcements from Thinking Machines Lab.

View all Thinking Machines Lab news

Discover AI Startups

Explore 2,000+ AI companies with VC-grade analysis, funding data, and investment insights.

Explore Dashboard