DeepHermes 3 - Llama-3.1 8B Preview (Non-reasoning) by Nous Research: LLM Benchmarks, Rankings & Specs

DeepHermes 3 - Llama-3.1 8B Preview (Non-reasoning) is an instruction-tuned language model developed by Nous Research, built upon Meta's Llama 3.1 8B foundation. It represents a preview release of the DeepHermes 3 series, which is notable for being among the first open-weight models to unify traditional conversational responses with deep reasoning capabilities distilled from the DeepSeek-R1 pipeline. The "Non-reasoning" designation refers to the model's standard or "intuitive" response mode, where it generates direct answers without the visible <think> tags or extended chain-of-thought processing.

In its non-reasoning mode, the model functions as a highly efficient general-purpose assistant, optimized for speed and directness. It inherits the Hermes lineage's strengths in instruction following, creative writing, and roleplay, while benefiting from the logic and objective improvements gained during the R1 distillation process. This allows the model to handle complex instructions and nuanced dialogue more effectively than previous non-reasoning versions in the 8B parameter class.

The model supports a substantial context window of 128,000 tokens, enabling it to process extensive documents and maintain coherence over long multi-turn interactions. It utilizes the Llama-Chat prompt format to ensure better steerability and compatibility with existing tool-calling and structured-output workflows. While the full DeepHermes 3 architecture supports toggleable deep reasoning, the non-reasoning configuration is specifically tracked for tasks where immediate, high-quality outputs are preferred over long-form internal deliberation.

DeepHermes 3 - Llama-3.1 8B Preview (Non-reasoning)

Explore AI Studio

Rankings & Comparison

DeepHermes 3 - Llama-3.1 8B Preview (Non-reasoning)

Explore AI Studio

Rankings & Comparison