Alibaba Open-Sources Qwen2.5-Omni: 7B Multimodal AI Runs on Phones
Alibaba's Qwen2.5-Omni: 7B multimodal AI for text, audio, video & speech—open-sourced, runs on phones.
"AI Disruption" Publication 5400 Subscriptions 20% Discount Offer Link.
On March 27, the Alibaba Tongyi Qianwen team released Qwen2.5-Omni.
This is a brand-new flagship multimodal large-scale model in the Qwen series, designed for comprehensive multimodal perception. It can seamlessly handle various inputs, including text, images, audio, and video, while supporting streaming text generation and natural speech synthesis output.