AI Disruption

AI Disruption

Alibaba Open-Sources Qwen2.5-Omni: 7B Multimodal AI Runs on Phones

Alibaba's Qwen2.5-Omni: 7B multimodal AI for text, audio, video & speech—open-sourced, runs on phones.

Meng Li's avatar
Meng Li
Mar 27, 2025
∙ Paid

"AI Disruption" Publication 5400 Subscriptions 20% Discount Offer Link.


On March 27, the Alibaba Tongyi Qianwen team released Qwen2.5-Omni.

This is a brand-new flagship multimodal large-scale model in the Qwen series, designed for comprehensive multimodal perception. It can seamlessly handle various inputs, including text, images, audio, and video, while supporting streaming text generation and natural speech synthesis output.

User's avatar

Continue reading this post for free, courtesy of Meng Li.

Or purchase a paid subscription.
© 2026 Meng Li · Privacy ∙ Terms ∙ Collection notice
Start your SubstackGet the app
Substack is the home for great culture