ZhiXiang Future (智象未来) releases 200B+ parameter image model HiDream-O1-Image-Pro, raises another round.

At its first Open Day in Beijing on May 19, 2026, ZhiXiang Future (智象未来) unveiled HiDream-O1-Image-Pro, a closed-source image foundation model with over 200 billion parameters built on its proprietary Unified Transformer (UiT) architecture. The company claims the model achieves new state-of-the-art results on multiple benchmarks spanning text-to-image, high-fidelity text rendering, diverse scene generation, and image editing. Separately, ZhiXiang Future announced a new funding round from investors including Shenzhen Capital Group (深创投), GP Capital (金浦投资), Caixin Capital (财鑫资本), and Fuju Capital (复聚资本), completing its second round of financing within two weeks, following a previously disclosed RMB 500M+ (~$70M) round.

Why it matters: This release exemplifies the capital-acceleration pattern seen in Chinese AI scale-ups, where companies are compressing multiple funding events into short windows to sustain capital-intensive model development. The HiDream-O1-Image-Pro bridges two structural trends: the industry-wide transition from U-Net to diffusion Transformer (DiT) architectures for image generation, and the push toward native multimodal models that jointly encode pixels, text tokens, and task conditions in a shared latent space—potentially differentiating ZhiXiang Future from the fragmented-encoder approaches used by many peers. The company's claim that native multimodality is a path to world models places this product at the center of an open debate about whether visual generation engines can evolve into agents that understand physical causality.

Grounded expert take: ZhiXiang Future's rapid funding cadence and simultaneous product release signal that Chinese generative media startups are entering a consolidation phase where scale of capital, not just technical demo, determines competitive positioning. The 200B-parameter closed-source tier positions the company against Zhipu AI and ByteDance's visual generation efforts, while the earlier open-source 8B-parameter HiDream-O1-Image served as a proof of architecture. The key unresolved risk is whether the native multimodal approach yields a commercially defensible moat versus the modular encoder stacks used by incumbents—a question that will be tested by enterprise adoption in the marketing, film, and short-video use cases ZhiXiang Future is prioritizing through its agent products HiBurst, ZhenZan (帧赞), and vivago.

#ZhiXiangFuture #HiDream #NativeMultimodal #ImageGeneration #ChineseAI #FoundationModels #GenerativeMedia

ZhiXiang Future (智象未来) releases 200B+ parameter image model HiDream-O1-Image-Pro, raises another round.

The AMW Read

How This Connects

More news from HiDream.ai

Discover AI Startups