Qwen3.5-Flash
by Qwen
The Qwen3.5 native vision-language Flash models are built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference efficiency. Compared to the 3 series, these models deliver a leap forward in performance for both pure text and multimodal tasks, offering fast response times while balancing inference speed and overall performance.
Specifications
Technical details and pricing.
Frequently Asked Questions
What is Qwen3.5-Flash good for?
Use Qwen3.5-Flash for everyday tasks like writing, summarizing, brainstorming, and getting clear explanations.
How much does Qwen3.5-Flash cost?
Pricing is based on usage. Current rates are $0.10/1M tokens for input and $0.40/1M tokens for output.
Can I try Qwen3.5-Flash for free?
Yes. You can start a chat instantly and test the model before deciding on a plan.
Does Qwen3.5-Flash support images or audio?
Qwen3.5-Flash can understand images.
Similar Models
Other models you might want to explore.
Pricing, context, and capability data are sourced from OpenRouter.