LangMart: DeepSeek: DeepSeek R1 0528 Qwen3 8B
Model Overview
| Property | Value |
|---|---|
| Model ID | openrouter/deepseek/deepseek-r1-0528-qwen3-8b |
| Name | DeepSeek: DeepSeek R1 0528 Qwen3 8B |
| Provider | deepseek |
| Released | 2025-05-29 |
Description
DeepSeek-R1-0528 is a lightly upgraded release of DeepSeek R1 that taps more compute and smarter post-training tricks, pushing its reasoning and inference to the brink of flagship models like O3 and Gemini 2.5 Pro. It now tops math, programming, and logic leaderboards, showcasing a step-change in depth-of-thought. The distilled variant, DeepSeek-R1-0528-Qwen3-8B, transfers this chain-of-thought into an 8 B-parameter form, beating standard Qwen3 8B by +10 pp and tying the 235 B “thinking” giant on AIME 2024.
Description
LangMart: DeepSeek: DeepSeek R1 0528 Qwen3 8B is a language model provided by deepseek. This model offers advanced capabilities for natural language processing tasks.
Provider
deepseek
Specifications
| Spec | Value |
|---|---|
| Context Window | 32,768 tokens |
| Modalities | text->text |
| Input Modalities | text |
| Output Modalities | text |
Pricing
| Type | Price |
|---|---|
| Input | $0.02 per 1M tokens |
| Output | $0.10 per 1M tokens |
Capabilities
- Frequency penalty
- Include reasoning
- Max tokens
- Presence penalty
- Reasoning
- Repetition penalty
- Response format
- Seed
- Stop
- Structured outputs
- Temperature
- Top k
- Top p