Today at 12:00 PM

Alibaba launches Qwen3.5-Omni series with omnimodal, multilingual, and captioning upgrades

Alibaba Cloud has introduced Qwen3.5-Omni as the latest entry in its large language model lineup, expanding the series with Qwen3.5-Omni-Plus and the Qwen3.5-Omni-Plus-Realtime models. Qwen3.5-Omni is positioned as the company's leading omnimodal large language model, offering integrated support for text, image, audio, and audio-visual content understanding. The architecture employs Hybrid-Attention Mixture-of-Experts for both its Thinker and Talker components. The lineup includes instruct models of varying capabilities: Plus, Flash, and Light.

Building on this foundation, the Qwen3.5-Omni models enable a 256,000 token long-context input, process over 10 hours of audio, and manage more than 400 seconds of 720p video at one frame per second. These models are pretrained on extensive multimodal datasets, including more than 100 million hours of audio-visual material, which supports their content generation and perception across formats.

In terms of language support, Qwen3.5-Omni substantiates major improvements with speech recognition for 113 languages and dialects, and speech generation in 36. While these multilingual upgrades broaden its reach, Qwen3.5-Omni-Plus also outperforms Gemini-3.1 Pro in audio tasks and matches its performance in audio-visual understanding. The series features advanced captioning, capable of screenplay-level descriptions, scene segmentation, timestamping, and detailed mapping of character relationships within audio content. The new models are available through both Offline and Realtime APIs.

Today by Paul

MORE ABOUT: #AI Chatbots #Large Language Model (LLM) Tools #AI Writing Tools #Qwen Image #Qwen #Qwen Code

Qwen

AI Chatbot
Free
Open Source

Qwen is Alibaba Cloud's general-purpose AI model, offering an AI-powered chatbot experience. Rated 4.6, it features an ad-free interface and supports dark mode for user comfort. Qwen is designed to facilitate seamless interactions, leveraging advanced AI capabilities. Users seeking alternatives might consider exploring other AI chatbots available in the market.

External links

Qwen3.5-Omni: Scaling Up, Toward Native Omni-Modal AGI
Alibaba Cloud • Official source
Qwen 3.5 Omni: Alibaba’s AI Model Can Now Hear, Watch, and Clone Your Voice
Decrypt

No comments so far, maybe you want to be first?

Alibaba launches Qwen3.5-Omni series with omnimodal, multilingual, and captioning upgrades

Related news

External links