LangMart: Nous: DeepHermes 3 Mistral 24B Preview

Model Overview

Property	Value
Model ID	`openrouter/nousresearch/deephermes-3-mistral-24b-preview`
Name	Nous: DeepHermes 3 Mistral 24B Preview
Provider	nousresearch
Released	2025-05-09

Description

DeepHermes 3 (Mistral 24B Preview) is an instruction-tuned language model by Nous Research based on Mistral-Small-24B, designed for chat, function calling, and advanced multi-turn reasoning. It introduces a dual-mode system that toggles between intuitive chat responses and structured “deep reasoning” mode using special system prompts. Fine-tuned via distillation from R1, it supports structured output (JSON mode) and function call syntax for agent-based applications.

DeepHermes 3 supports a reasoning toggle via system prompt, allowing users to switch between fast, intuitive responses and deliberate, multi-step reasoning. When activated with the following specific system instruction, the model enters a "deep thinking" mode—generating extended chains of thought wrapped in <think></think> tags before delivering a final answer.

System Prompt: You are a deep thinking AI, you may use extremely long chains of thought to deeply consider the problem and deliberate with yourself via systematic reasoning processes to help come to a correct solution prior to answering. You should enclose your thoughts and internal monologue inside tags, and then provide your solution or response to the problem.

Description

LangMart: Nous: DeepHermes 3 Mistral 24B Preview is a language model provided by nousresearch. This model offers advanced capabilities for natural language processing tasks.

Provider

nousresearch

Specifications

Spec	Value
Context Window	32,768 tokens
Modalities	text->text
Input Modalities	text
Output Modalities	text

Pricing

Type	Price
Input	$0.02 per 1M tokens
Output	$0.10 per 1M tokens

Capabilities

Frequency penalty
Include reasoning
Max tokens
Presence penalty
Reasoning
Repetition penalty
Response format
Seed
Stop
Structured outputs
Temperature
Tool choice
Tools
Top k
Top p

Detailed Analysis

DeepHermes 3 Mistral 24B Preview is a specialized fine-tune by NousResearch that unifies two inference modes within a single model: extended reasoning chains (long chains of thought improving answer accuracy) and standard LLM response modes. This hybrid architecture enables the model to switch between fast standard responses and slower, more deliberate reasoning depending on query complexity - essentially combining the benefits of reasoning models like Magistral with the efficiency of standard models. DeepHermes 3 excels at problems requiring variable reasoning depth: simple queries get instant responses while complex problems trigger multi-step analysis. The 24B parameter base (likely Mistral Small architecture) provides strong general capabilities while the reasoning integration adds cognitive depth. The model achieves 47.3GB VRAM requirements and supports 32K context under Apache 2.0 license, enabling self-hosting for organizations requiring reasoning capabilities with local deployment. DeepHermes 3 Preview represents research into unified reasoning-response architectures, ideal for applications needing adaptive intelligence, research into reasoning model behavior, and deployments requiring both fast responses and deep reasoning without switching models. The Preview designation indicates ongoing development.