O

LangMart: DeepSeek: R1 Distill Llama 70B

Openrouter
131K
Context
$0.0300
Input /1M
$0.1100
Output /1M
N/A
Max Output

LangMart: DeepSeek: R1 Distill Llama 70B

Model Overview

Property Value
Model ID openrouter/deepseek/deepseek-r1-distill-llama-70b
Name DeepSeek: R1 Distill Llama 70B
Provider deepseek
Released 2025-01-23

Description

DeepSeek R1 Distill Llama 70B is a distilled large language model based on Llama-3.3-70B-Instruct, using outputs from DeepSeek R1. The model combines advanced distillation techniques to achieve high performance across multiple benchmarks, including:

  • AIME 2024 pass@1: 70.0
  • MATH-500 pass@1: 94.5
  • CodeForces Rating: 1633

The model leverages fine-tuning from DeepSeek R1's outputs, enabling competitive performance comparable to larger frontier models.

Description

LangMart: DeepSeek: R1 Distill Llama 70B is a language model provided by deepseek. This model offers advanced capabilities for natural language processing tasks.

Provider

deepseek

Specifications

Spec Value
Context Window 131,072 tokens
Modalities text->text
Input Modalities text
Output Modalities text

Pricing

Type Price
Input $0.03 per 1M tokens
Output $0.11 per 1M tokens

Capabilities

  • Frequency penalty
  • Include reasoning
  • Logit bias
  • Max tokens
  • Min p
  • Presence penalty
  • Reasoning
  • Repetition penalty
  • Response format
  • Seed
  • Stop
  • Structured outputs
  • Temperature
  • Tool choice
  • Tools
  • Top k
  • Top p

Detailed Analysis

DeepSeek-R1 (Reasoner) Model Analysis