Qwen AI Models

Alibaba Cloud's model team. Builds the Qwen series covering text, vision, code, and audio.

Founded2023Hangzhou, China9 ModelsWebsite →

Qwen3.5-Flash

Qwen

The Qwen3.5 native vision-language Flash models are built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference efficiency.

Context1.0M

Speed283 tok/s

InputText, Image, Video

OutputText

ReasoningYes

Details →

Qwen3.5-9B

Qwen

Qwen3.5-9B is a multimodal foundation model from the Qwen3.5 family, designed to deliver strong reasoning, coding, and visual understanding in an efficient 9B-parameter architecture.

Context262K

Speed143 tok/s

InputText, Image, Video

OutputText

ReasoningYes

Details →

Qwen3.5 397B A17B

Qwen

The Qwen3.5 series 397B-A17B native vision-language model is built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference efficiency.

Context262K

Speed53 tok/s

InputText, Image, Video

OutputText

ReasoningYes

Details →

Qwen3.5 Plus

Qwen

The Qwen3.5 native vision-language series Plus models are built on a hybrid architecture that integrates linear attention mechanisms with sparse mixture-of-experts models, achieving higher inference efficiency.

Context1.0M

Speed52 tok/s

InputText, Image, Video

OutputText

ReasoningYes

Details →

Qwen3.5-35B-A3B

Qwen

The Qwen3.5 Series 35B-A3B is a native vision-language model designed with a hybrid architecture that integrates linear attention mechanisms and a sparse mixture-of-experts model, achieving higher inference efficiency.

Context262K

Speed239 tok/s

InputText, Image, Video

OutputText

ReasoningYes

Details →

Qwen3 Coder Next

Qwen

Qwen3-Coder-Next is an open-weight causal language model optimized for coding agents and local development workflows.

Context262K

Speed150 tok/s

InputText

OutputText

ReasoningNo

Details →

Qwen3.5-27B

Qwen

The Qwen3.5 27B native vision-language Dense model incorporates a linear attention mechanism, delivering fast response times while balancing inference speed and performance.

Context262K

Speed242 tok/s

InputText, Image, Video

OutputText

ReasoningYes

Details →

Qwen3.5-122B-A10B

Qwen

The Qwen3.5 122B-A10B native vision-language model is built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference efficiency.

Context262K

Speed162 tok/s

InputText, Image, Video

OutputText

ReasoningYes

Details →

Qwen3 Max Thinking

Qwen

Qwen3-Max-Thinking is the flagship reasoning model in the Qwen3 series, designed for high-stakes cognitive tasks that require deep, multi-step reasoning.

Context262K

Speed45 tok/s

InputText

OutputText

ReasoningYes

Details →