Nous: Hermes 4 405B
nousresearch/hermes-4-405b
Description
Hermes 4 is a large-scale reasoning model built on Meta-Llama-3.1-405B and released by Nous Research. It introduces a hybrid reasoning mode, where the model can choose to deliberate internally with <think>...</think> traces or respond directly, offering flexibility between speed and depth. Users can control the reasoning behaviour with the reasoning enabled boolean.
The model is instruction-tuned with an expanded post-training corpus (~60B tokens) emphasizing reasoning traces, improving performance in math, code, STEM, and logical reasoning, while retaining broad assistant utility. It also supports structured outputs, including JSON mode, schema adherence, function calling, and tool use. Hermes 4 is trained for steerability, lower refusal rates, and alignment toward neutral, user-directed behavior.
How this model compares
Overall covers the full catalog. By plan covers only models available on that tier (same rules as available models in your list). Position on min–average–max. Prices use the higher of prompt or completion per token, shown per 1M tokens.
Price (per 1M tokens)
Min
Max
This model
339 models in this groupPrice (per 1M tokens)
- Min
- $0.04
- Avg
- $12.395447
- Max
- $750.00
This model: $3.00 / 1M tokens
Context length (tokens)
Min
Max
This model
339 models in this groupContext length (tokens)
- Min
- 4,095 tokens
- Avg
- 379,884.782 tokens
- Max
- 10,000,000 tokens
This model: 131,072 tokens
Capabilities
Text → TextContext: 131,072 tokens
Input:
Text
Output:
Text