O

LangMart: DeepSeek: DeepSeek R1 0528 Qwen3 8B

Openrouter
33K
Context
$0.0200
Input /1M
$0.1000
Output /1M
N/A
Max Output

LangMart: DeepSeek: DeepSeek R1 0528 Qwen3 8B

Model Overview

Property Value
Model ID openrouter/deepseek/deepseek-r1-0528-qwen3-8b
Name DeepSeek: DeepSeek R1 0528 Qwen3 8B
Provider deepseek
Released 2025-05-29

Description

DeepSeek-R1-0528 is a lightly upgraded release of DeepSeek R1 that taps more compute and smarter post-training tricks, pushing its reasoning and inference to the brink of flagship models like O3 and Gemini 2.5 Pro. It now tops math, programming, and logic leaderboards, showcasing a step-change in depth-of-thought. The distilled variant, DeepSeek-R1-0528-Qwen3-8B, transfers this chain-of-thought into an 8 B-parameter form, beating standard Qwen3 8B by +10 pp and tying the 235 B “thinking” giant on AIME 2024.

Description

LangMart: DeepSeek: DeepSeek R1 0528 Qwen3 8B is a language model provided by deepseek. This model offers advanced capabilities for natural language processing tasks.

Provider

deepseek

Specifications

Spec Value
Context Window 32,768 tokens
Modalities text->text
Input Modalities text
Output Modalities text

Pricing

Type Price
Input $0.02 per 1M tokens
Output $0.10 per 1M tokens

Capabilities

  • Frequency penalty
  • Include reasoning
  • Max tokens
  • Presence penalty
  • Reasoning
  • Repetition penalty
  • Response format
  • Seed
  • Stop
  • Structured outputs
  • Temperature
  • Top k
  • Top p

Detailed Analysis

DeepSeek-R1 (Reasoner) Model Analysis