All Models

Qwen AI Models

Alibaba Cloud's model team. Builds the Qwen series covering text, vision, code, and audio.

Founded2023Hangzhou, China9 ModelsWebsite →
Qwen logo

Qwen3.5-Flash

Qwen

The Qwen3.5 native vision-language Flash models are built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference efficiency.

Context1.0M
Speed283 tok/s
InputText, Image, Video
OutputText
ReasoningYes
Qwen logo

Qwen3.5-9B

Qwen

Qwen3.5-9B is a multimodal foundation model from the Qwen3.5 family, designed to deliver strong reasoning, coding, and visual understanding in an efficient 9B-parameter architecture.

Context262K
Speed141 tok/s
InputText, Image, Video
OutputText
ReasoningYes
Qwen logo

Qwen3.5 397B A17B

Qwen

The Qwen3.5 series 397B-A17B native vision-language model is built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference efficiency.

Context262K
Speed53 tok/s
InputText, Image, Video
OutputText
ReasoningYes
Qwen logo

Qwen3.5 Plus

Qwen

The Qwen3.5 native vision-language series Plus models are built on a hybrid architecture that integrates linear attention mechanisms with sparse mixture-of-experts models, achieving higher inference efficiency.

Context1.0M
Speed52 tok/s
InputText, Image, Video
OutputText
ReasoningYes
Qwen logo

Qwen3.5-35B-A3B

Qwen

The Qwen3.5 Series 35B-A3B is a native vision-language model designed with a hybrid architecture that integrates linear attention mechanisms and a sparse mixture-of-experts model, achieving higher inference efficiency.

Context262K
Speed238 tok/s
InputText, Image, Video
OutputText
ReasoningYes
Qwen logo

Qwen3 Coder Next

Qwen

Qwen3-Coder-Next is an open-weight causal language model optimized for coding agents and local development workflows.

Context262K
Speed150 tok/s
InputText
OutputText
ReasoningNo
Qwen logo

Qwen3.5-27B

Qwen

The Qwen3.5 27B native vision-language Dense model incorporates a linear attention mechanism, delivering fast response times while balancing inference speed and performance.

Context262K
Speed241 tok/s
InputText, Image, Video
OutputText
ReasoningYes
Qwen logo

Qwen3.5-122B-A10B

Qwen

The Qwen3.5 122B-A10B native vision-language model is built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference efficiency.

Context262K
Speed160 tok/s
InputText, Image, Video
OutputText
ReasoningYes
Qwen logo

Qwen3 Max Thinking

Qwen

Qwen3-Max-Thinking is the flagship reasoning model in the Qwen3 series, designed for high-stakes cognitive tasks that require deep, multi-step reasoning.

Context262K
Speed43 tok/s
InputText
OutputText
ReasoningYes
Customer Support