Qwen Models
Qwen logo

Qwen3 32B

32B

by Qwen

Qwen3-32B is a dense 32.8B parameter causal language model from the Qwen3 series, optimized for both complex reasoning and efficient dialogue. It supports seamless switching between a "thinking" mode for tasks like math, coding, and logical inference, and a "non-thinking" mode for faster, general-purpose conversation. The model demonstrates strong performance in instruction-following, agent tool use, creative writing, and multilingual tasks across 100+ languages and dialects. It natively handles 32K token contexts and can extend to 131K tokens using YaRN-based scaling.

Chat with Qwen3 32B
Input Price$0.70/1M tokens
Output Price$8.40/1M tokens
Intelligence16.5
Coding13.8

Specifications

Technical details and pricing.

ProviderQwen
Context Window40,960 tokens
Release DateApr 28, 2025
ModalitiesText

Benchmarks

12 benchmark scores from Artificial Analysis.

GPQA66.8%
MMLU Pro79.8%
HLE8.3%
LiveCodeBench54.6%
MATH 50096.1%
AIME 202573.0%
AIME80.7%
SciCode35.4%
LCR0.0%
IFBench36.3%
Tau229.8%
TerminalBench Hard3.0%

Composite Indices

Intelligence, Coding, Math

Standard Benchmarks

Academic and industry benchmarks

Frequently Asked Questions

What is Qwen3 32B good for?

Use Qwen3 32B for everyday tasks like writing, summarizing, brainstorming, and getting clear explanations.

How much does Qwen3 32B cost?

Pricing is based on usage. Current rates are $0.70/1M tokens for input and $8.40/1M tokens for output.

Can I try Qwen3 32B for free?

Yes. You can start a chat instantly and test the model before deciding on a plan.

Does Qwen3 32B support images or audio?

Qwen3 32B focuses on text-based tasks.

Benchmarks and pricing are sourced from Artificial Analysis where available. OpenRouter specs are used as a fallback.