Hermes 4 405B
405Bby Nous
Hermes 4 is a large-scale reasoning model built on Meta-Llama-3.1-405B and released by Nous Research. It introduces a hybrid reasoning mode, where the model can choose to deliberate internally with <think>...</think> traces or respond directly, offering flexibility between speed and depth. Users can control the reasoning behaviour with the `reasoning` `enabled` boolean. It also supports structured outputs, including JSON mode, schema adherence, function calling, and tool use. Hermes 4 is trained for steerability, lower refusal rates, and alignment toward neutral, user-directed behavior.
Specifications
Technical details and pricing.
Benchmarks
10 benchmark scores from Artificial Analysis.
Composite Indices
Intelligence, Coding, Math
Standard Benchmarks
Academic and industry benchmarks
Frequently Asked Questions
What is Hermes 4 405B good for?
Use Hermes 4 405B for everyday tasks like writing, summarizing, brainstorming, and getting clear explanations.
How much does Hermes 4 405B cost?
Pricing is based on usage. Current rates are $1.00/1M tokens for input and $3.00/1M tokens for output.
Can I try Hermes 4 405B for free?
Yes. You can start a chat instantly and test the model before deciding on a plan.
Does Hermes 4 405B support images or audio?
Hermes 4 405B focuses on text-based tasks.
Similar Models
Other models you might want to explore.
Benchmarks and pricing are sourced from Artificial Analysis where available. OpenRouter specs are used as a fallback.