LangMart: DeepSeek: R1 Distill Llama 70B

Model Overview

Property	Value
Model ID	`openrouter/deepseek/deepseek-r1-distill-llama-70b`
Name	DeepSeek: R1 Distill Llama 70B
Provider	deepseek
Released	2025-01-23

Description

DeepSeek R1 Distill Llama 70B is a distilled large language model based on Llama-3.3-70B-Instruct, using outputs from DeepSeek R1. The model combines advanced distillation techniques to achieve high performance across multiple benchmarks, including:

AIME 2024 pass@1: 70.0
MATH-500 pass@1: 94.5
CodeForces Rating: 1633

The model leverages fine-tuning from DeepSeek R1's outputs, enabling competitive performance comparable to larger frontier models.

Description

LangMart: DeepSeek: R1 Distill Llama 70B is a language model provided by deepseek. This model offers advanced capabilities for natural language processing tasks.

Provider

deepseek

Specifications

Spec	Value
Context Window	131,072 tokens
Modalities	text->text
Input Modalities	text
Output Modalities	text

Pricing

Type	Price
Input	$0.03 per 1M tokens
Output	$0.11 per 1M tokens

Capabilities

Frequency penalty
Include reasoning
Logit bias
Max tokens
Min p
Presence penalty
Reasoning
Repetition penalty
Response format
Seed
Stop
Structured outputs
Temperature
Tool choice
Tools
Top k
Top p

Detailed Analysis

DeepSeek-R1 (Reasoner) Model Analysis