Qwen3-Omni

Native end-to-end multilingual omni-modal LLM

2025-09-23

Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.
Open Source Artificial Intelligence Audio