O

LangMart: Nous: DeepHermes 3 Mistral 24B Preview

Openrouter
33K
Context
$0.0200
Input /1M
$0.1000
Output /1M
N/A
Max Output

LangMart: Nous: DeepHermes 3 Mistral 24B Preview

Model Overview

Property Value
Model ID openrouter/nousresearch/deephermes-3-mistral-24b-preview
Name Nous: DeepHermes 3 Mistral 24B Preview
Provider nousresearch
Released 2025-05-09

Description

DeepHermes 3 (Mistral 24B Preview) is an instruction-tuned language model by Nous Research based on Mistral-Small-24B, designed for chat, function calling, and advanced multi-turn reasoning. It introduces a dual-mode system that toggles between intuitive chat responses and structured “deep reasoning” mode using special system prompts. Fine-tuned via distillation from R1, it supports structured output (JSON mode) and function call syntax for agent-based applications.

DeepHermes 3 supports a reasoning toggle via system prompt, allowing users to switch between fast, intuitive responses and deliberate, multi-step reasoning. When activated with the following specific system instruction, the model enters a "deep thinking" mode—generating extended chains of thought wrapped in <think></think> tags before delivering a final answer.

System Prompt: You are a deep thinking AI, you may use extremely long chains of thought to deeply consider the problem and deliberate with yourself via systematic reasoning processes to help come to a correct solution prior to answering. You should enclose your thoughts and internal monologue inside tags, and then provide your solution or response to the problem.

Description

LangMart: Nous: DeepHermes 3 Mistral 24B Preview is a language model provided by nousresearch. This model offers advanced capabilities for natural language processing tasks.

Provider

nousresearch

Specifications

Spec Value
Context Window 32,768 tokens
Modalities text->text
Input Modalities text
Output Modalities text

Pricing

Type Price
Input $0.02 per 1M tokens
Output $0.10 per 1M tokens

Capabilities

  • Frequency penalty
  • Include reasoning
  • Max tokens
  • Presence penalty
  • Reasoning
  • Repetition penalty
  • Response format
  • Seed
  • Stop
  • Structured outputs
  • Temperature
  • Tool choice
  • Tools
  • Top k
  • Top p

Detailed Analysis

DeepHermes 3 Mistral 24B Preview is a specialized fine-tune by NousResearch that unifies two inference modes within a single model: extended reasoning chains (long chains of thought improving answer accuracy) and standard LLM response modes. This hybrid architecture enables the model to switch between fast standard responses and slower, more deliberate reasoning depending on query complexity - essentially combining the benefits of reasoning models like Magistral with the efficiency of standard models. DeepHermes 3 excels at problems requiring variable reasoning depth: simple queries get instant responses while complex problems trigger multi-step analysis. The 24B parameter base (likely Mistral Small architecture) provides strong general capabilities while the reasoning integration adds cognitive depth. The model achieves 47.3GB VRAM requirements and supports 32K context under Apache 2.0 license, enabling self-hosting for organizations requiring reasoning capabilities with local deployment. DeepHermes 3 Preview represents research into unified reasoning-response architectures, ideal for applications needing adaptive intelligence, research into reasoning model behavior, and deployments requiring both fast responses and deep reasoning without switching models. The Preview designation indicates ongoing development.