MiMo-V2-Flash

Name: MiMo-V2-Flash
Brand: Xiaomi
Price: 0.1 USD

by Xiaomi

MiMo-V2-Flash is an open-source foundation language model developed by Xiaomi. It is a Mixture-of-Experts model with 309B total parameters and 15B active parameters, adopting hybrid attention architecture. MiMo-V2-Flash supports a hybrid-thinking toggle and a 256K context window, and excels at reasoning, coding, and agent scenarios. On SWE-bench Verified and SWE-bench Multilingual, MiMo-V2-Flash ranks as the top #1 open-source model globally, delivering performance comparable to Claude Sonnet 4.5 while costing only about 3.5% as much. Users can control the reasoning behaviour with the `reasoning` `enabled` boolean.

Chat with MiMo-V2-Flash

Input Price$0.10/1M tokens

Output Price$0.30/1M tokens

Intelligence39.2

Coding31.8

Specifications

Technical details and pricing.

ProviderXiaomi

Context Window262,144 tokens

Release DateDec 16, 2025

ModalitiesText

Benchmarks

10 benchmark scores from Artificial Analysis.

GPQA84.6%

MMLU Pro84.3%

HLE21.1%

LiveCodeBench86.8%

AIME 202596.3%

SciCode39.4%

LCR63.0%

IFBench64.2%

Tau295.0%

TerminalBench Hard28.0%

Composite Indices

Intelligence, Coding, Math

Standard Benchmarks

Academic and industry benchmarks

Frequently Asked Questions

What is MiMo-V2-Flash good for?

Use MiMo-V2-Flash for everyday tasks like writing, summarizing, brainstorming, and getting clear explanations.

How much does MiMo-V2-Flash cost?

Pricing is based on usage. Current rates are $0.10/1M tokens for input and $0.30/1M tokens for output.

Can I try MiMo-V2-Flash for free?

Yes. You can start a chat instantly and test the model before deciding on a plan.

Does MiMo-V2-Flash support images or audio?

MiMo-V2-Flash focuses on text-based tasks.

Similar Models

Other models you might want to explore.

LongCat Flash Chat

Meituan

LongCat-Flash-Chat is a large-scale Mixture-of-Experts (MoE) model with 560B total parameters, of which 18.6B–31.3B (≈27B on average) are dynamically activated per input.

Details →

Skyfall 36B V2

TheDrummer

Skyfall 36B v2 is an enhanced iteration of Mistral Small 2501, specifically fine-tuned for improved creativity, nuanced writing, role-playing, and coherent storytelling.

Details →

Qwen3 Coder Flash

Qwen

Qwen3 Coder Flash is Alibaba's fast and cost efficient version of their proprietary Qwen3 Coder Plus.

Details →

Benchmarks and pricing are sourced from Artificial Analysis where available. OpenRouter specs are used as a fallback.