Skip to main content
Back to News
Alibaba (阿里巴巴) launches Qwen3.7-Plus multimodal large model. On June 2, 2026, Alibaba Cloud released...
Product
2 min read
CN

Alibaba (阿里巴巴) launches Qwen3.7-Plus multimodal large model. On June 2, 2026, Alibaba Cloud released...

The AMW Read

Novelty=1: Alibaba continues its established pattern of releasing Qwen-series updated models; significant=2: multimodal agent capability and Vision Arena ranking represent a segment-level update for Chinese foundation-model competition.
NoveltySignificance
Foundation Models · Player MapFoundation Models · Case StudiesScaling Laws

Alibaba (阿里巴巴) launches Qwen3.7-Plus multimodal large model. On June 2, 2026, Alibaba Cloud released Qwen3.7-Plus, its latest multimodal foundation model with integrated text and vision capabilities. The model ranks in the global top five on the Vision Arena visual model leaderboard and first among Chinese models. Qwen3.7-Plus unifies image, video, screen, web page, and text understanding within a single agentic workflow, performing deep reasoning, code generation, tool calling, testing, and iterative self-correction. It can replicate mobile apps and desktop software directly from visual references. The model is available through Alibaba Cloud's Bailian platform via API and on Qwen Studio (chat.qwen.ai).

Why it matters: Qwen3.7-Plus exemplifies the emerging "multimodal agent" pattern where foundation models evolve beyond pure text reasoning into unified perception-action systems. This move by Alibaba directly updates the hyperscaler-distribution moat pattern — by bundling the model with its Bailian enterprise cloud platform, Alibaba replicates the strategy of top Chinese and U.S. labs converging on agent-based model delivery. The release also continues Alibaba's aggressive open-weight strategy (following Qwen3.6 in April 2026) which pressures the global foundation-model segment by compressing the pricing and capability gap between Chinese and Western frontier models.

Grounded expert take: With Qwen3.7-Plus, Alibaba demonstrates that the frontier for Chinese foundation-model labs has shifted from text benchmarks to multimodal agent performance. The "one-click replicate desktop software" capability signals increased pressure on both Western frontier model labs (OpenAI, Anthropic, Google) and specialized agent infrastructure players (Cursor, Windsurf). For enterprise buyers, this accelerates the timeline for viable AI agents that can replace traditional GUI-based workflows, though the reliability of multi-step desktop automation at scale remains unproven.

#Qwen3.7 #Alibaba #MultimodalAI #AIAgents #Wechat #FoundationModels #VisionModels

#Alibaba Cloud#Qwen3.7-Plus#multimodal AI#AI agents#vision language model#foundation models#China AI
Read Original

How This Connects

Based on Foundation Models · Player Map

  1. 1d agoOpenAI proposes mandatory AI safety assessment framework, diverging from Trump administration's voluntary NSA-led approachOpenAI
  2. 4d agoDeepSeek in talks to raise $7 billion from Tencent, CATL and other investorsDeepSeek
  3. 6d agoAlibaba (阿里巴巴) launches Qwen3.7-Plus multimodal large model. On June 2, 2026, Alibaba Cloud released... · THIS ARTICLE
  4. 1w ago## DeepSeek makes 75% price cut permanent on flagship modelDeepSeek
  5. 1mo agoDeepSeek V4 Preview: 1.6 Trillion Parameters, Open-Weight Challenge to Frontier LabsDeepSeek
  6. 1mo agoRecursive Superintelligence Secures $500 Million Investment at $4 Billion ValuationRecursive Superintelligence

Related News

Discover AI Startups

Explore 2,000+ AI companies with VC-grade analysis, funding data, and investment insights.

Explore Dashboard