All Models

DeepSeek AI Models

Chinese AI lab producing high-performance open-weight models. Strong in coding and mathematical reasoning.

Founded 2023Hangzhou, China10 Models Website →
DeepSeek logo

DeepSeek V3.2

DeepSeek

DeepSeek-V3.2 is a large language model designed to harmonize high computational efficiency with strong reasoning and agentic tool-use performance.

Context164K
Speed48 tok/s
InputText
OutputText
ReasoningYes
DeepSeek logo

DeepSeek V3 0324

DeepSeek

DeepSeek V3, a 685B-parameter, mixture-of-experts model, is the latest iteration of the flagship chat model family from the DeepSeek team.

Context164K
SpeedN/A
InputText
OutputText
ReasoningNo
DeepSeek logo

DeepSeek V3.1

DeepSeek

DeepSeek-V3.1 is a large hybrid reasoning model (671B parameters, 37B active) that supports both thinking and non-thinking modes via prompt templates.

Context33K
SpeedN/A
InputText
OutputText
ReasoningYes
DeepSeek logo

DeepSeek V3.2 Exp

DeepSeek

DeepSeek-V3.2-Exp is an experimental large language model released by DeepSeek as an intermediate step between V3.1 and future architectures.

Context164K
Speed47 tok/s
InputText
OutputText
ReasoningYes
DeepSeek logo

R1 0528

DeepSeek

May 28th update to the [original DeepSeek R1](/deepseek/deepseek-r1) Performance on par with [OpenAI o1](/openai/o1), but open-sourced and with fully open reasoning tokens.

Context164K
SpeedN/A
InputText
OutputText
ReasoningYes
DeepSeek logo

DeepSeek V3.1 Terminus

DeepSeek

DeepSeek-V3.1 Terminus is an update to [DeepSeek V3.1](/deepseek/deepseek-chat-v3.1) that maintains the model's original capabilities while addressing issues reported by users, including language consistency and agent capabilities, further optimizing the model's performance in coding and search agents.

Context164K
SpeedN/A
InputText
OutputText
ReasoningYes
DeepSeek logo

R1

DeepSeek

DeepSeek R1 is here: Performance on par with [OpenAI o1](/openai/o1), but open-sourced and with fully open reasoning tokens.

Context64K
SpeedN/A
InputText
OutputText
ReasoningYes
DeepSeek logo

DeepSeek V3.2 Speciale

DeepSeek

DeepSeek-V3.2-Speciale is a high-compute variant of DeepSeek-V3.2 optimized for maximum reasoning and agentic performance.

Context164K
SpeedN/A
InputText
OutputText
ReasoningYes
DeepSeek logo

R1 Distill Qwen 32B

DeepSeek

DeepSeek R1 Distill Qwen 32B is a distilled large language model based on [Qwen 2.5 32B](https://huggingface.co/Qwen/Qwen2.5-32B), using outputs from [DeepSeek R1](/deepseek/deepseek-r1).

Context33K
Speed56 tok/s
InputText
OutputText
ReasoningYes
DeepSeek logo

R1 Distill Llama 70B

DeepSeek

DeepSeek R1 Distill Llama 70B is a distilled large language model based on [Llama-3.3-70B-Instruct](/meta-llama/llama-3.3-70b-instruct), using outputs from [DeepSeek R1](/deepseek/deepseek-r1).

Context131K
Speed57 tok/s
InputText
OutputText
ReasoningYes