Qwen3 8B

Name: Qwen3 8B
Brand: Qwen
Price: 0.18 USD

by Qwen

Qwen3-8B is a dense 8.2B parameter causal language model from the Qwen3 series, designed for both reasoning-heavy tasks and efficient dialogue. It supports seamless switching between "thinking" mode for math, coding, and logical inference, and "non-thinking" mode for general conversation. The model is fine-tuned for instruction-following, agent integration, creative writing, and multilingual use across 100+ languages and dialects. It natively supports a 32K token context window and can extend to 131K tokens with YaRN scaling.

Chat with Qwen3 8B

Input Price$0.18/1M tokens

Output Price$2.10/1M tokens

Intelligence13.1

Coding9.0

Specifications

Technical details and pricing.

ProviderQwen

Context Window32,000 tokens

Release DateApr 28, 2025

ModalitiesText

Benchmarks

12 benchmark scores from Artificial Analysis.

GPQA58.9%

MMLU Pro74.3%

HLE4.2%

LiveCodeBench40.6%

MATH 50090.4%

AIME 202519.0%

AIME74.7%

SciCode22.6%

LCR0.0%

IFBench33.5%

Tau227.8%

TerminalBench Hard2.3%

Composite Indices

Intelligence, Coding, Math

Standard Benchmarks

Academic and industry benchmarks

Frequently Asked Questions

What is Qwen3 8B good for?

Use Qwen3 8B for everyday tasks like writing, summarizing, brainstorming, and getting clear explanations.

How much does Qwen3 8B cost?

Pricing is based on usage. Current rates are $0.18/1M tokens for input and $2.10/1M tokens for output.

Can I try Qwen3 8B for free?

Yes. You can start a chat instantly and test the model before deciding on a plan.

Does Qwen3 8B support images or audio?

Qwen3 8B focuses on text-based tasks.

Similar Models

Other models you might want to explore.

Qwen3 32B

Qwen

Qwen3-32B is a dense 32.8B parameter causal language model from the Qwen3 series, optimized for both complex reasoning and efficient dialogue.

Details →

Qwen3 14B

Qwen

Qwen3-14B is a dense 14.8B parameter causal language model from the Qwen3 series, designed for both complex reasoning and efficient dialogue.

Details →

Qwen3 Max

Qwen

Qwen3-Max is an updated release built on the Qwen3 series, offering major improvements in reasoning, instruction following, multilingual support, and long-tail knowledge coverage compared to the January 2025 version.

Details →

Benchmarks and pricing are sourced from Artificial Analysis where available. OpenRouter specs are used as a fallback.