LLMs nach Kategorien

Top KI-Modelle nach Kategorie

Vergleiche aktuelle Modelle aus Open Source, proprietär, uncensored, Coding, Mathe, Geschwindigkeit und Neuerscheinungen.

Meistgenutzte KI-Modelle

Beliebte Modelle im aktuellen Katalog.

MoonshotAI logo

Kimi K2.7 Code

NEW

MoonshotAI

MoonshotAI: Kimi K2.7 Code is a coding-focused model in Moonshot AI's Kimi K2 family, built to complete end-to-end programming tasks reliably over long contexts. It uses a native multimodal mixture-of-experts...

Kontext 262K
Geschwindigkeit 127 tok/s
Eingabe Text, Image, Video
Ausgabe Text
Reasoning Ja
Anthropic logo

Claude Fable 5

NEW

Anthropic

Claude Fable 5 is a Mythos-class model from Anthropic, built for autonomous knowledge work and coding. It supports text, image, and file inputs with text output, with reasoning support and...

Kontext 1.0M
Geschwindigkeit 142 tok/s
Eingabe Text, Image, File
Ausgabe Text
Reasoning Ja
NVIDIA logo

Nemotron 3 Ultra

NEW

NVIDIA

NVIDIA Nemotron 3 Ultra is an open frontier-reasoning and orchestration model from NVIDIA, with 55B active parameters out of 550B total (MoE). Built on a hybrid Transformer-Mamba mixture-of-experts architecture, it...

Kontext 262K
Geschwindigkeit N/A
Eingabe Text
Ausgabe Text
Reasoning Ja
Qwen logo

Qwen3.7 Plus

NEW

Qwen

Qwen3.7-Plus is a cost-effective model in Alibaba's Qwen3.7 series. It supports text and image input with text output, building on the series' text capabilities with a comprehensive upgrade to its...

Kontext 1.0M
Geschwindigkeit 180 tok/s
Eingabe Text, Image
Ausgabe Text
Reasoning Ja
Minimax logo

M3

NEW

Minimax

MiniMax-M3 is a multimodal foundation model from MiniMax. It supports text, image, and video inputs with text output, a 1M-token context window, and is suited for long-horizon agentic work, coding,...

Kontext 524K
Geschwindigkeit 56 tok/s
Eingabe Text, Image, Video
Ausgabe Text
Reasoning Ja
Stepfun logo

Step 3.7 Flash

Stepfun

Step 3.7 Flash is StepFun's latest high-efficiency multimodal Mixture-of-Experts model. It pairs a 196B-parameter language backbone with a vision encoder for native image and video understanding, activating roughly 11B parameters...

Kontext 256K
Geschwindigkeit 403 tok/s
Eingabe Text, Image, Video
Ausgabe Text
Reasoning Ja
Anthropic logo

Claude Opus 4.8 (Fast)

Anthropic

Fast-mode variant of [Opus 4.8](/anthropic/claude-opus-4.8) - identical capabilities with higher output speed at 2x pricing relative to regular Opus 4.8. Learn more in Anthropic's docs: https://platform.claude.com/docs/en/build-with-claude/fast-mode

Kontext 1.0M
Geschwindigkeit N/A
Eingabe Text, Image, File
Ausgabe Text
Reasoning Ja
Anthropic logo

Claude Opus 4.8

Anthropic

Claude Opus 4.8 is Anthropic's most capable generally available model in the Opus family. It supports text, image, and file inputs with text output, with reasoning support and a 1M-token...

Kontext 1.0M
Geschwindigkeit 60 tok/s
Eingabe Text, Image, File
Ausgabe Text
Reasoning Ja
Qwen logo

Qwen3.7 Max

Qwen

Qwen3.7-Max is the flagship model in Alibaba's Qwen3.7 series. It supports text input and output and is designed for agent-centric workloads, with particular strengths in coding, office and productivity tasks,...

Kontext 1.0M
Geschwindigkeit 188 tok/s
Eingabe Text
Ausgabe Text
Reasoning Ja
xAI logo

Grok Build 0.1

xAI

Grok Build 0.1 is xAI’s fast coding model trained specifically for agentic software engineering workflows. It supports text and image inputs with text output, and is optimized for interactive coding...

Kontext 256K
Geschwindigkeit N/A
Eingabe Text, Image
Ausgabe Text
Reasoning Ja

Top Open-Source-KI-Modelle

Community-getrieben und transparent.

MoonshotAI logo

Kimi K2.6

MoonshotAI

Kimi K2.6 is Moonshot AI's next-generation multimodal model, designed for long-horizon coding, coding-driven UI/UX generation, and multi-agent orchestration. It handles complex end-to-end coding tasks across Python, Rust, and Go, and...

Kontext 262K
Geschwindigkeit 42 tok/s
Eingabe Text, Image
Ausgabe Text
Reasoning Ja
Xiaomi logo

MiMo-V2.5-Pro

Xiaomi

MiMo-V2.5-Pro is Xiaomi’s flagship model, delivering strong performance in general agentic capabilities, complex software engineering, and long-horizon tasks, with top rankings on benchmarks such as ClawEval, GDPVal, and SWE-bench Pro....

Kontext 1.0M
Geschwindigkeit 157 tok/s
Eingabe Text
Ausgabe Text
Reasoning Ja
Deepseek logo

V4 Pro

Deepseek

DeepSeek V4 Pro is a large-scale Mixture-of-Experts model from DeepSeek with 1.6T total parameters and 49B activated parameters, supporting a 1M-token context window. It is designed for advanced reasoning, coding,...

Kontext 1.0M
Geschwindigkeit 80 tok/s
Eingabe Text
Ausgabe Text
Reasoning Ja
Z Ai logo

GLM 5.1

Z Ai

GLM-5.1 delivers a major leap in coding capability, with particularly significant gains in handling long-horizon tasks. Unlike previous models built around minute-level interactions, GLM-5.1 can work independently and continuously on...

Kontext 203K
Geschwindigkeit 83 tok/s
Eingabe Text
Ausgabe Text
Reasoning Ja
Qwen logo

Qwen3.6 Plus

Qwen

Qwen 3.6 Plus builds on a hybrid architecture that combines efficient linear attention with sparse mixture-of-experts routing, enabling strong scalability and high-performance inference. Compared to the 3.5 series, it delivers...

Kontext 1.0M
Geschwindigkeit 53 tok/s
Eingabe Text, Image, Video
Ausgabe Text
Reasoning Ja
Minimax logo

M2.7

Minimax

MiniMax-M2.7 is a next-generation large language model designed for autonomous, real-world productivity and continuous improvement. Built to actively participate in its own evolution, M2.7 integrates advanced agentic capabilities through multi-agent...

Kontext 197K
Geschwindigkeit 44 tok/s
Eingabe Text
Ausgabe Text
Reasoning Ja
Z Ai logo

GLM 5 Turbo

Z Ai

GLM-5 Turbo is a new model from Z.ai designed for fast inference and strong performance in agent-driven environments such as OpenClaw scenarios. It is deeply optimized for real-world agent workflows...

Kontext 262K
Geschwindigkeit N/A
Eingabe Text
Ausgabe Text
Reasoning Ja
MoonshotAI logo

Kimi K2.5

MoonshotAI

Kimi K2.5 is Moonshot AI's native multimodal model, delivering state-of-the-art visual coding capability and a self-directed agent swarm paradigm. Built on Kimi K2 with continued pretraining over approximately 15T mixed...

Kontext 256K
Geschwindigkeit 43 tok/s
Eingabe Text, Image
Ausgabe Text
Reasoning Ja
Deepseek logo

V4 Flash

Deepseek

DeepSeek V4 Flash is an efficiency-optimized Mixture-of-Experts model from DeepSeek with 284B total parameters and 13B activated parameters, supporting a 1M-token context window. It is designed for fast inference and...

Kontext 1.0M
Geschwindigkeit 104 tok/s
Eingabe Text
Ausgabe Text
Reasoning Ja
Qwen logo

Qwen3.5 397B A17B

Qwen

The Qwen3.5 series 397B-A17B native vision-language model is built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference efficiency. It delivers...

Kontext 262K
Geschwindigkeit 52 tok/s
Eingabe Text, Image, Video
Ausgabe Text
Reasoning Ja

Top Proprietäre KI-Modelle

Führende Closed-Source-Modelle.

Anthropic logo

Claude Fable 5

NEW

Anthropic

Claude Fable 5 is a Mythos-class model from Anthropic, built for autonomous knowledge work and coding. It supports text, image, and file inputs with text output, with reasoning support and...

Kontext 1.0M
Geschwindigkeit 142 tok/s
Eingabe Text, Image, File
Ausgabe Text
Reasoning Ja
Anthropic logo

Claude Opus 4.8

Anthropic

Claude Opus 4.8 is Anthropic's most capable generally available model in the Opus family. It supports text, image, and file inputs with text output, with reasoning support and a 1M-token...

Kontext 1.0M
Geschwindigkeit 60 tok/s
Eingabe Text, Image, File
Ausgabe Text
Reasoning Ja
OpenAI logo

GPT-5.5

OpenAI

GPT-5.5 is OpenAI’s frontier model designed for complex professional workloads, building on GPT-5.4 with stronger reasoning, higher reliability, and improved token efficiency on hard tasks. It features a 1M+ token...

Kontext 1.1M
Geschwindigkeit 62 tok/s
Eingabe File, Image, Text
Ausgabe Text
Reasoning Ja
OpenAI logo

GPT-5.5 Pro

OpenAI

GPT-5.5 Pro is OpenAI’s high-capability model optimized for deep reasoning and accuracy on complex, high-stakes workloads. It features a 1M+ token context window (922K input, 128K output) with support for...

Kontext 1.1M
Geschwindigkeit 357 tok/s
Eingabe File, Image, Text
Ausgabe Text
Reasoning Ja
Google logo

Gemini 3.1 Pro Preview

Google

Gemini 3.1 Pro Preview is Google’s frontier reasoning model, delivering enhanced software engineering performance, improved agentic reliability, and more efficient token usage across complex workflows. Building on the multimodal foundation...

Kontext 1.0M
Geschwindigkeit 137 tok/s
Eingabe Audio, File, Image, Text, Video
Ausgabe Text
Reasoning Ja
OpenAI logo

GPT-5.4 Image 2

OpenAI

It enables rich multimodal workflows, allowing users to seamlessly move between reasoning, coding, and...

Kontext 272K
Geschwindigkeit 137 tok/s
Eingabe Image, Text, File
Ausgabe Image, Text
Reasoning Ja
Qwen logo

Qwen3.7 Max

Qwen

Qwen3.7-Max is the flagship model in Alibaba's Qwen3.7 series. It supports text input and output and is designed for agent-centric workloads, with particular strengths in coding, office and productivity tasks,...

Kontext 1.0M
Geschwindigkeit 188 tok/s
Eingabe Text
Ausgabe Text
Reasoning Ja
Google logo

Gemini 3.5 Flash

Google

Gemini 3.5 Flash is Google's high-efficiency multimodal model, bringing near-Pro level coding and reasoning at Flash-tier cost and speed. It is highly optimized for coding proficiency and parallel agentic execution...

Kontext 1.0M
Geschwindigkeit 279 tok/s
Eingabe Text, Image, Video, File, Audio
Ausgabe Text
Reasoning Ja
Minimax logo

M3

NEW

Minimax

MiniMax-M3 is a multimodal foundation model from MiniMax. It supports text, image, and video inputs with text output, a 1M-token context window, and is suited for long-horizon agentic work, coding,...

Kontext 524K
Geschwindigkeit 56 tok/s
Eingabe Text, Image, Video
Ausgabe Text
Reasoning Ja
OpenAI logo

GPT-5.3-Codex

OpenAI

GPT-5.3-Codex is OpenAI’s most advanced agentic coding model, combining the frontier software engineering performance of GPT-5.2-Codex with the broader reasoning and professional knowledge capabilities of GPT-5.2. It achieves state-of-the-art results...

Kontext 400K
Geschwindigkeit 194 tok/s
Eingabe Text, Image, File
Ausgabe Text
Reasoning Ja

Top Coding-KI-Modelle

Optimiert für Code und Entwickler-Workflows.

Anthropic logo

Claude Fable 5

NEW

Anthropic

Claude Fable 5 is a Mythos-class model from Anthropic, built for autonomous knowledge work and coding. It supports text, image, and file inputs with text output, with reasoning support and...

Kontext 1.0M
Geschwindigkeit 142 tok/s
Eingabe Text, Image, File
Ausgabe Text
Reasoning Ja
OpenAI logo

GPT-5.5

OpenAI

GPT-5.5 is OpenAI’s frontier model designed for complex professional workloads, building on GPT-5.4 with stronger reasoning, higher reliability, and improved token efficiency on hard tasks. It features a 1M+ token...

Kontext 1.1M
Geschwindigkeit 62 tok/s
Eingabe File, Image, Text
Ausgabe Text
Reasoning Ja
OpenAI logo

GPT-5.5 Pro

OpenAI

GPT-5.5 Pro is OpenAI’s high-capability model optimized for deep reasoning and accuracy on complex, high-stakes workloads. It features a 1M+ token context window (922K input, 128K output) with support for...

Kontext 1.1M
Geschwindigkeit 357 tok/s
Eingabe File, Image, Text
Ausgabe Text
Reasoning Ja
OpenAI logo

GPT-5.4 Image 2

OpenAI

It enables rich multimodal workflows, allowing users to seamlessly move between reasoning, coding, and...

Kontext 272K
Geschwindigkeit 137 tok/s
Eingabe Image, Text, File
Ausgabe Image, Text
Reasoning Ja
Anthropic logo

Claude Opus 4.8

Anthropic

Claude Opus 4.8 is Anthropic's most capable generally available model in the Opus family. It supports text, image, and file inputs with text output, with reasoning support and a 1M-token...

Kontext 1.0M
Geschwindigkeit 60 tok/s
Eingabe Text, Image, File
Ausgabe Text
Reasoning Ja
Google logo

Gemini 3.1 Pro Preview

Google

Gemini 3.1 Pro Preview is Google’s frontier reasoning model, delivering enhanced software engineering performance, improved agentic reliability, and more efficient token usage across complex workflows. Building on the multimodal foundation...

Kontext 1.0M
Geschwindigkeit 137 tok/s
Eingabe Audio, File, Image, Text, Video
Ausgabe Text
Reasoning Ja
OpenAI logo

GPT-5.3-Codex

OpenAI

GPT-5.3-Codex is OpenAI’s most advanced agentic coding model, combining the frontier software engineering performance of GPT-5.2-Codex with the broader reasoning and professional knowledge capabilities of GPT-5.2. It achieves state-of-the-art results...

Kontext 400K
Geschwindigkeit 194 tok/s
Eingabe Text, Image, File
Ausgabe Text
Reasoning Ja
OpenAI logo

GPT-5.4 Mini

OpenAI

GPT-5.4 mini brings the core capabilities of GPT-5.4 to a faster, more efficient model optimized for high-throughput workloads. It supports text and image inputs with strong performance across reasoning, coding,...

Kontext 400K
Geschwindigkeit 178 tok/s
Eingabe File, Image, Text
Ausgabe Text
Reasoning Ja
Anthropic logo

Claude Sonnet 4.6

Anthropic

Sonnet 4.6 is Anthropic's most capable Sonnet-class model yet, with frontier performance across coding, agents, and professional work. It excels at iterative development, complex codebase navigation, end-to-end project management with...

Kontext 1.0M
Geschwindigkeit 62 tok/s
Eingabe Text, Image, File
Ausgabe Text
Reasoning Ja
Qwen logo

Qwen3.7 Max

Qwen

Qwen3.7-Max is the flagship model in Alibaba's Qwen3.7 series. It supports text input and output and is designed for agent-centric workloads, with particular strengths in coding, office and productivity tasks,...

Kontext 1.0M
Geschwindigkeit 188 tok/s
Eingabe Text
Ausgabe Text
Reasoning Ja

Top OCR-KI-Modelle

Spezialisiert auf Texterkennung und Dokument-Extraktion.

PaddlePaddle logo

PaddleOCR-VL-0.9B

PaddlePaddle

Baidu's 0.9B vision-language OCR model combining a NaViT-style dynamic-resolution encoder with ERNIE-4.5-0.3B. Handles multilingual text, tables, charts, and formulas across 16K context — optimized for efficient on-device document parsing.

Kontext 16K
Geschwindigkeit N/A
Eingabe Text, Image
Ausgabe Text
Reasoning Nein
AllenAI logo

olmOCR-2-7B

AllenAI

Allen AI's 7B OCR model fine-tuned from Qwen2.5-VL-7B on curated academic papers and technical documentation. Supports 128K context and extracts structured text from PDFs and scanned documents with high fidelity.

Kontext 128K
Geschwindigkeit N/A
Eingabe Text, Image
Ausgabe Text
Reasoning Nein
DeepSeek logo

DeepSeek-OCR

DeepSeek

DeepSeek's ~3B MoE OCR model using optical context compression to encode full pages into compact token sequences. Outputs structured Markdown preserving text layout, tables, and mathematical formulas from images and PDFs.

Kontext N/A
Geschwindigkeit N/A
Eingabe Text, Image
Ausgabe Text
Reasoning Nein
Mistral AI logo

Mistral OCR

Mistral AI

Mistral's dedicated document understanding model (December 2025). Processes PDFs and images page-by-page via API, returning structured Markdown with preserved tables, equations, image bounding boxes, and rich layout metadata.

Kontext N/A
Geschwindigkeit N/A
Eingabe Image, Pdf
Ausgabe Text
Reasoning Nein

Top Mathe-KI-Modelle

Spezialisten für Mathematik und Reasoning.

OpenAI logo

GPT-5.5 Pro

OpenAI

GPT-5.5 Pro is OpenAI’s high-capability model optimized for deep reasoning and accuracy on complex, high-stakes workloads. It features a 1M+ token context window (922K input, 128K output) with support for...

Kontext 1.1M
Geschwindigkeit 357 tok/s
Eingabe File, Image, Text
Ausgabe Text
Reasoning Ja
OpenAI logo

GPT-5.3-Codex

OpenAI

GPT-5.3-Codex is OpenAI’s most advanced agentic coding model, combining the frontier software engineering performance of GPT-5.2-Codex with the broader reasoning and professional knowledge capabilities of GPT-5.2. It achieves state-of-the-art results...

Kontext 400K
Geschwindigkeit 194 tok/s
Eingabe Text, Image, File
Ausgabe Text
Reasoning Ja
Google logo

Gemini 3.5 Flash

Google

Gemini 3.5 Flash is Google's high-efficiency multimodal model, bringing near-Pro level coding and reasoning at Flash-tier cost and speed. It is highly optimized for coding proficiency and parallel agentic execution...

Kontext 1.0M
Geschwindigkeit 279 tok/s
Eingabe Text, Image, Video, File, Audio
Ausgabe Text
Reasoning Ja
Deepseek logo

V4 Pro

Deepseek

DeepSeek V4 Pro is a large-scale Mixture-of-Experts model from DeepSeek with 1.6T total parameters and 49B activated parameters, supporting a 1M-token context window. It is designed for advanced reasoning, coding,...

Kontext 1.0M
Geschwindigkeit 80 tok/s
Eingabe Text
Ausgabe Text
Reasoning Ja
Xiaomi logo

MiMo-V2-Flash

Xiaomi

MiMo-V2-Flash is an open-source foundation language model developed by Xiaomi. It is a Mixture-of-Experts model with 309B total parameters and 15B active parameters, adopting hybrid attention architecture. MiMo-V2-Flash supports a...

Kontext 262K
Geschwindigkeit 156 tok/s
Eingabe Text
Ausgabe Text
Reasoning Ja
Google logo

Gemini 3.1 Pro Preview

Google

Gemini 3.1 Pro Preview is Google’s frontier reasoning model, delivering enhanced software engineering performance, improved agentic reliability, and more efficient token usage across complex workflows. Building on the multimodal foundation...

Kontext 1.0M
Geschwindigkeit 137 tok/s
Eingabe Audio, File, Image, Text, Video
Ausgabe Text
Reasoning Ja
Z Ai logo

GLM 4.7 Flash

Z Ai

As a 30B-class SOTA model, GLM-4.7-Flash offers a new option that balances performance and efficiency. It is further optimized for agentic coding use cases, strengthening coding capabilities, long-horizon task planning,...

Kontext 203K
Geschwindigkeit 108 tok/s
Eingabe Text
Ausgabe Text
Reasoning Ja
MoonshotAI logo

Kimi K2.7 Code

NEW

MoonshotAI

MoonshotAI: Kimi K2.7 Code is a coding-focused model in Moonshot AI's Kimi K2 family, built to complete end-to-end programming tasks reliably over long contexts. It uses a native multimodal mixture-of-experts...

Kontext 262K
Geschwindigkeit 127 tok/s
Eingabe Text, Image, Video
Ausgabe Text
Reasoning Ja
xAI logo

Grok 4.3

xAI

Grok 4.3 is a reasoning model from xAI. It accepts text and image inputs with text output, and is suited for agentic workflows, instruction-following tasks, and applications requiring high factual...

Kontext 1.0M
Geschwindigkeit 179 tok/s
Eingabe Text, Image
Ausgabe Text
Reasoning Ja
Anthropic logo

Claude Opus 4.8

Anthropic

Claude Opus 4.8 is Anthropic's most capable generally available model in the Opus family. It supports text, image, and file inputs with text output, with reasoning support and a 1M-token...

Kontext 1.0M
Geschwindigkeit 60 tok/s
Eingabe Text, Image, File
Ausgabe Text
Reasoning Ja

Schnelle KI-Modelle

Niedrige Kosten bei geringer Latenz.

Inception logo

Mercury 2

Inception

Mercury 2 is an extremely fast reasoning LLM, and the first reasoning diffusion LLM (dLLM). Instead of generating tokens sequentially, Mercury 2 produces and refines multiple tokens in parallel, achieving...

Kontext 128K
Geschwindigkeit 1006 tok/s
Eingabe Text
Ausgabe Text
Reasoning Ja
Liquid logo

LiquidAI: LFM2-24B-A2B

Liquid

LFM2-24B-A2B is the largest model in the LFM2 family of hybrid architectures designed for efficient on-device deployment. Built as a 24B parameter Mixture-of-Experts model with only 2B active parameters per...

Kontext 33K
Geschwindigkeit 528 tok/s
Eingabe Text
Ausgabe Text
Reasoning Nein
Ibm Granite logo

IBM: Granite 4.1 8B

Ibm Granite

Granite 4.1 8B is a dense, decoder-only 8-billion-parameter language model from IBM, part of the Granite 4.1 family. It supports a 131K-token context window and is designed for enterprise tasks...

Kontext 131K
Geschwindigkeit 421 tok/s
Eingabe Text
Ausgabe Text
Reasoning Nein
Stepfun logo

Step 3.7 Flash

Stepfun

Step 3.7 Flash is StepFun's latest high-efficiency multimodal Mixture-of-Experts model. It pairs a 196B-parameter language backbone with a vision encoder for native image and video understanding, activating roughly 11B parameters...

Kontext 256K
Geschwindigkeit 403 tok/s
Eingabe Text, Image, Video
Ausgabe Text
Reasoning Ja
OpenAI logo

GPT-5.5 Pro

OpenAI

GPT-5.5 Pro is OpenAI’s high-capability model optimized for deep reasoning and accuracy on complex, high-stakes workloads. It features a 1M+ token context window (922K input, 128K output) with support for...

Kontext 1.1M
Geschwindigkeit 357 tok/s
Eingabe File, Image, Text
Ausgabe Text
Reasoning Ja
Google logo

Gemini 3.1 Flash Lite

Google

Gemini 3.1 Flash Lite is Google’s GA high-efficiency multimodal model optimized for low-latency, high-volume workloads. It supports text, image, video, audio, and PDF inputs, and is designed for lightweight agentic...

Kontext 1.0M
Geschwindigkeit 325 tok/s
Eingabe Text, Image, Video, File, Audio
Ausgabe Text
Reasoning Ja
Google logo

Gemini 3.5 Flash

Google

Gemini 3.5 Flash is Google's high-efficiency multimodal model, bringing near-Pro level coding and reasoning at Flash-tier cost and speed. It is highly optimized for coding proficiency and parallel agentic execution...

Kontext 1.0M
Geschwindigkeit 279 tok/s
Eingabe Text, Image, Video, File, Audio
Ausgabe Text
Reasoning Ja
Qwen logo

Qwen3.5-Flash

Qwen

The Qwen3.5 native vision-language Flash models are built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference efficiency. Compared to the...

Kontext 1.0M
Geschwindigkeit 259 tok/s
Eingabe Text, Image, Video
Ausgabe Text
Reasoning Ja
Minimax logo

M2.1

Minimax

MiniMax-M2.1 is a lightweight, state-of-the-art large language model optimized for coding, agentic workflows, and modern application development. With only 10 billion activated parameters, it delivers a major jump in real-world...

Kontext 197K
Geschwindigkeit 222 tok/s
Eingabe Text
Ausgabe Text
Reasoning Ja
Arcee Ai logo

Trinity Large Thinking

Arcee Ai

Trinity Large Thinking is a powerful open source reasoning model from the team at Arcee AI. It shows strong performance in PinchBench, agentic workloads, and reasoning tasks. Launch video: https://youtu.be/Gc82AXLa0Rg?si=4RLn6WBz33qT--B7...

Kontext 262K
Geschwindigkeit 204 tok/s
Eingabe Text
Ausgabe Text
Reasoning Ja

Top Bildgenerierungs-KI-Modelle

Modelle, die aus Textprompts Bilder erzeugen.

OpenAI logo

GPT-5.4 Image 2

OpenAI

It enables rich multimodal workflows, allowing users to seamlessly move between reasoning, coding, and...

Kontext 272K
Geschwindigkeit 137 tok/s
Eingabe Image, Text, File
Ausgabe Image, Text
Reasoning Ja
Google logo

Nano Banana 2 (Gemini 3.1 Flash Image Preview)

Google

Gemini 3.1 Flash Image Preview, a.k.a. "Nano Banana 2," is Google’s latest state of the art image generation and editing model, delivering Pro-level visual quality at Flash speed. It combines...

Kontext 66K
Geschwindigkeit N/A
Eingabe Image, Text
Ausgabe Image, Text
Reasoning Ja

Top Audio-KI-Modelle

Modelle mit Sprach- und Audio-Ausgabe.

OpenAI logo

GPT Audio

OpenAI

The gpt-audio model is OpenAI's first generally available audio model. The new snapshot features an upgraded decoder for more natural sounding voices and maintains better voice consistency. Audio is priced...

Kontext 128K
Geschwindigkeit N/A
Eingabe Text, Audio
Ausgabe Text, Audio
Reasoning Nein
OpenAI logo

GPT Audio Mini

OpenAI

A cost-efficient version of GPT Audio. The new snapshot features an upgraded decoder for more natural sounding voices and maintains better voice consistency. Input is priced at $0.60 per million...

Kontext 128K
Geschwindigkeit 155 tok/s
Eingabe Text, Audio
Ausgabe Text, Audio
Reasoning Nein

KI-Modelle mit großem Kontextfenster

Modelle mit 200K+ Kontextfenstern.

xAI logo

Grok 4.20 Multi-Agent

xAI

Grok 4.20 Multi-Agent is a variant of xAI’s Grok 4.20 designed for collaborative, agent-based workflows. Multiple agents operate in parallel to conduct deep research, coordinate tool use, and synthesize information...

Kontext 2.0M
Geschwindigkeit 192 tok/s
Eingabe Text, Image, File
Ausgabe Text
Reasoning Ja
OpenAI logo

GPT-5.5 Pro

OpenAI

GPT-5.5 Pro is OpenAI’s high-capability model optimized for deep reasoning and accuracy on complex, high-stakes workloads. It features a 1M+ token context window (922K input, 128K output) with support for...

Kontext 1.1M
Geschwindigkeit 357 tok/s
Eingabe File, Image, Text
Ausgabe Text
Reasoning Ja
OpenAI logo

GPT-5.5

OpenAI

GPT-5.5 is OpenAI’s frontier model designed for complex professional workloads, building on GPT-5.4 with stronger reasoning, higher reliability, and improved token efficiency on hard tasks. It features a 1M+ token...

Kontext 1.1M
Geschwindigkeit 62 tok/s
Eingabe File, Image, Text
Ausgabe Text
Reasoning Ja
Google logo

Gemini 3.5 Flash

Google

Gemini 3.5 Flash is Google's high-efficiency multimodal model, bringing near-Pro level coding and reasoning at Flash-tier cost and speed. It is highly optimized for coding proficiency and parallel agentic execution...

Kontext 1.0M
Geschwindigkeit 279 tok/s
Eingabe Text, Image, Video, File, Audio
Ausgabe Text
Reasoning Ja
Google logo

Gemini 3.1 Flash Lite

Google

Gemini 3.1 Flash Lite is Google’s GA high-efficiency multimodal model optimized for low-latency, high-volume workloads. It supports text, image, video, audio, and PDF inputs, and is designed for lightweight agentic...

Kontext 1.0M
Geschwindigkeit 325 tok/s
Eingabe Text, Image, Video, File, Audio
Ausgabe Text
Reasoning Ja
Deepseek logo

V4 Pro

Deepseek

DeepSeek V4 Pro is a large-scale Mixture-of-Experts model from DeepSeek with 1.6T total parameters and 49B activated parameters, supporting a 1M-token context window. It is designed for advanced reasoning, coding,...

Kontext 1.0M
Geschwindigkeit 80 tok/s
Eingabe Text
Ausgabe Text
Reasoning Ja
Xiaomi logo

MiMo-V2.5-Pro

Xiaomi

MiMo-V2.5-Pro is Xiaomi’s flagship model, delivering strong performance in general agentic capabilities, complex software engineering, and long-horizon tasks, with top rankings on benchmarks such as ClawEval, GDPVal, and SWE-bench Pro....

Kontext 1.0M
Geschwindigkeit 157 tok/s
Eingabe Text
Ausgabe Text
Reasoning Ja
Xiaomi logo

MiMo-V2.5

Xiaomi

MiMo-V2.5 is a native omnimodal model by Xiaomi. It delivers Pro-level agentic performance at roughly half the inference cost, while surpassing MiMo-V2-Omni in multimodal perception across image and video understanding...

Kontext 1.0M
Geschwindigkeit 45 tok/s
Eingabe Text, Audio, Image, Video
Ausgabe Text
Reasoning Ja
Google logo

Gemini 3.1 Pro Preview Custom Tools

Google

Gemini 3.1 Pro Preview Custom Tools is a variant of Gemini 3.1 Pro that improves tool selection behavior by preventing overuse of a general bash tool when more efficient third-party...

Kontext 1.0M
Geschwindigkeit N/A
Eingabe Text, Audio, Image, Video, File
Ausgabe Text
Reasoning Ja
Google logo

Gemini 3.1 Pro Preview

Google

Gemini 3.1 Pro Preview is Google’s frontier reasoning model, delivering enhanced software engineering performance, improved agentic reliability, and more efficient token usage across complex workflows. Building on the multimodal foundation...

Kontext 1.0M
Geschwindigkeit 137 tok/s
Eingabe Audio, File, Image, Text, Video
Ausgabe Text
Reasoning Ja

Top Uncensored KI-Modelle

Leicht gefilterte Modelle mit hoher Flexibilität.

Neueste KI-Modelle

Frische Modell-Releases.

MoonshotAI logo

Kimi K2.7 Code

NEW

MoonshotAI

MoonshotAI: Kimi K2.7 Code is a coding-focused model in Moonshot AI's Kimi K2 family, built to complete end-to-end programming tasks reliably over long contexts. It uses a native multimodal mixture-of-experts...

Kontext 262K
Geschwindigkeit 127 tok/s
Eingabe Text, Image, Video
Ausgabe Text
Reasoning Ja
Anthropic logo

Claude Fable 5

NEW

Anthropic

Claude Fable 5 is a Mythos-class model from Anthropic, built for autonomous knowledge work and coding. It supports text, image, and file inputs with text output, with reasoning support and...

Kontext 1.0M
Geschwindigkeit 142 tok/s
Eingabe Text, Image, File
Ausgabe Text
Reasoning Ja
NVIDIA logo

Nemotron 3 Ultra

NEW

NVIDIA

NVIDIA Nemotron 3 Ultra is an open frontier-reasoning and orchestration model from NVIDIA, with 55B active parameters out of 550B total (MoE). Built on a hybrid Transformer-Mamba mixture-of-experts architecture, it...

Kontext 262K
Geschwindigkeit N/A
Eingabe Text
Ausgabe Text
Reasoning Ja
Qwen logo

Qwen3.7 Plus

NEW

Qwen

Qwen3.7-Plus is a cost-effective model in Alibaba's Qwen3.7 series. It supports text and image input with text output, building on the series' text capabilities with a comprehensive upgrade to its...

Kontext 1.0M
Geschwindigkeit 180 tok/s
Eingabe Text, Image
Ausgabe Text
Reasoning Ja
Minimax logo

M3

NEW

Minimax

MiniMax-M3 is a multimodal foundation model from MiniMax. It supports text, image, and video inputs with text output, a 1M-token context window, and is suited for long-horizon agentic work, coding,...

Kontext 524K
Geschwindigkeit 56 tok/s
Eingabe Text, Image, Video
Ausgabe Text
Reasoning Ja
Stepfun logo

Step 3.7 Flash

Stepfun

Step 3.7 Flash is StepFun's latest high-efficiency multimodal Mixture-of-Experts model. It pairs a 196B-parameter language backbone with a vision encoder for native image and video understanding, activating roughly 11B parameters...

Kontext 256K
Geschwindigkeit 403 tok/s
Eingabe Text, Image, Video
Ausgabe Text
Reasoning Ja
Anthropic logo

Claude Opus 4.8 (Fast)

Anthropic

Fast-mode variant of [Opus 4.8](/anthropic/claude-opus-4.8) - identical capabilities with higher output speed at 2x pricing relative to regular Opus 4.8. Learn more in Anthropic's docs: https://platform.claude.com/docs/en/build-with-claude/fast-mode

Kontext 1.0M
Geschwindigkeit N/A
Eingabe Text, Image, File
Ausgabe Text
Reasoning Ja
Anthropic logo

Claude Opus 4.8

Anthropic

Claude Opus 4.8 is Anthropic's most capable generally available model in the Opus family. It supports text, image, and file inputs with text output, with reasoning support and a 1M-token...

Kontext 1.0M
Geschwindigkeit 60 tok/s
Eingabe Text, Image, File
Ausgabe Text
Reasoning Ja
Qwen logo

Qwen3.7 Max

Qwen

Qwen3.7-Max is the flagship model in Alibaba's Qwen3.7 series. It supports text input and output and is designed for agent-centric workloads, with particular strengths in coding, office and productivity tasks,...

Kontext 1.0M
Geschwindigkeit 188 tok/s
Eingabe Text
Ausgabe Text
Reasoning Ja
xAI logo

Grok Build 0.1

xAI

Grok Build 0.1 is xAI’s fast coding model trained specifically for agentic software engineering workflows. It supports text and image inputs with text output, and is optimized for interactive coding...

Kontext 256K
Geschwindigkeit N/A
Eingabe Text, Image
Ausgabe Text
Reasoning Ja

AI chat subscription

Turn model research into daily AI work.

Use 100+ models, web search, files, and EU-hosted options in one paid chat workspace.

Inference credits

Build with EU-hosted open-source models.

OpenAI-compatible API for GLM, Kimi, DeepSeek and more. Add credits inside the dashboard.

So findest du das richtige KI-Modell

Ein praktischer Leitfaden für die Modellwahl nach Use Case.

Modell an Aufgabe anpassen

Allzweckmodelle wie GPT-4o und Claude Sonnet funktionieren für viele Aufgaben gut. Für Spezialfälle sind Coding-Modelle (DeepSeek Coder, Codestral) und Mathe-Modelle (QwQ, DeepSeek R1) oft präziser und günstiger pro Token.

Kontextfenster berücksichtigen

Für lange Dokumente, Codebasen oder lange Konversationen ist die Kontextgröße entscheidend. Modelle reichen von 8K bis über 1M Token. Größere Fenster erlauben mehr Input, erhöhen aber häufig Kosten und Latenz.

Kosten, Geschwindigkeit und Qualität balancieren

Frontier-Modelle liefern Top-Benchmarkwerte, sind aber teurer und oft langsamer. Schnelle Modelle wie Gemini Flash, Llama 3 (8B) und Mistral Small sind für hohe Last oft deutlich günstiger und reagieren schneller.

Open Source vs. proprietär

Open-Source-Modelle (Llama, Mistral, Qwen, DeepSeek) bieten Self-Hosting und Anpassbarkeit. Proprietäre Modelle (GPT-4o, Claude, Gemini) führen häufig bei Benchmarks. Viele Teams kombinieren beide Ansätze.

Multimodalität prüfen

Einige Modelle verarbeiten neben Text auch Bilder, Audio oder Dateien. Für Workflows mit Screenshots, Diagrammen oder Audio-Transkripten sind Vision-/Audio-Inputs wichtig. Strukturierte Outputs und Function Calling sind zentral für Agenten.

Benchmarks als Ausgangspunkt nutzen

GPQA, MMLU Pro und HLE messen Wissen und Reasoning. LiveCodeBench und SciCode testen Coding-Praxis. MATH 500 und AIME bewerten mathematisches Lösen. Vergleiche relevante Kategorien und teste zusätzlich mit eigenen Prompts.

Modellkatalog, Preise, Geschwindigkeit und Benchmark-Scores werden regelmäßig aktualisiert. Du kannst jedes Modell sofort im kostenlosen Chat testen.