Nous Models
Nous logo

Hermes 3 405B Instruct

405B

by Nous

Hermes 3 is a generalist language model with many improvements over Hermes 2, including advanced agentic capabilities, much better roleplaying, reasoning, multi-turn conversation, long context coherence, and improvements across the board. Hermes 3 405B is a frontier-level, full-parameter finetune of the Llama-3.1 405B foundation model, focused on aligning LLMs to the user, with powerful steering capabilities and control given to the end user. The Hermes 3 series builds and expands on the Hermes 2 set of capabilities, including more powerful and reliable function calling and structured output capabilities, generalist assistant capabilities, and improved code generation skills. Hermes 3 is competitive, if not superior, to Llama-3.1 Instruct models at general capabilities, with varying strengths and weaknesses attributable between the two.

Chat with Hermes 3 405B Instruct
Input Price$1.00/1M tokens
Output Price$3.00/1M tokens
Intelligence17.6
Coding18.1

Specifications

Technical details and pricing.

ProviderNous
Context Window131,072 tokens
Release DateAug 27, 2025
ModalitiesText

Benchmarks

10 benchmark scores from Artificial Analysis.

GPQA53.6%
MMLU Pro72.9%
HLE4.2%
LiveCodeBench54.6%
AIME 202515.3%
SciCode34.6%
LCR20.0%
IFBench34.8%
Tau226.6%
TerminalBench Hard9.8%

Composite Indices

Intelligence, Coding, Math

Standard Benchmarks

Academic and industry benchmarks

Frequently Asked Questions

What is Hermes 3 405B Instruct good for?

Use Hermes 3 405B Instruct for everyday tasks like writing, summarizing, brainstorming, and getting clear explanations.

How much does Hermes 3 405B Instruct cost?

Pricing is based on usage. Current rates are $1.00/1M tokens for input and $3.00/1M tokens for output.

Can I try Hermes 3 405B Instruct for free?

Yes. You can start a chat instantly and test the model before deciding on a plan.

Does Hermes 3 405B Instruct support images or audio?

Hermes 3 405B Instruct focuses on text-based tasks.

Benchmarks and pricing are sourced from Artificial Analysis where available. OpenRouter specs are used as a fallback.