Skip to content

DeepSeek R1 0528

DeepSeek R1 0528 is DeepSeek's open-source reasoning model, released January 20, 2025. It scores 79.8% Pass@1 on AIME 2024 and 97.3% on MATH-500. Weights ship under the MIT License for commercial use.

ReasoningImplicit Caching
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'deepseek/deepseek-r1',
prompt: 'Why is the sky blue?'
})

Playground

Try out DeepSeek R1 0528 by DeepSeek. Usage is billed to your team at API rates. Free users (those who haven't made a payment) get $5 of credits every 30 days.

Providers

Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.

Provider
Context
Latency
Throughput
Input
Output
Cache
Web Search
Per Query
Capabilities
ZDR
No Training
Release Date
DeepInfra
Legal:Terms
Privacy
160K
0.5s
32tps
$0.50/M$2.15/M
Read:$0.35/M
Write:
01/20/2025
Amazon Bedrock
Legal:Terms
Privacy
128K
0.3s
71tps
$1.35/M$5.40/M
01/20/2025
Throughput

P50 throughput on live AI Gateway traffic, in tokens per second (TPS). Visit the docs for more info.

Latency

P50 time to first token (TTFT) on live AI Gateway traffic, in milliseconds. View the docs for more info.

Uptime

Direct request success rate on AI Gateway and per-provider. Visit the docs for more info.

More models by DeepSeek

Model
Context
Latency
Throughput
Input
Output
Cache
Web Search
Per Query
Capabilities
Providers
ZDR
No Training
Release Date
1M
1.0s
136tps
$0.14/M$0.28/M
Read:$0.0/M
Write:
deepinfra logo
deepseek logo
novita logo
04/23/2026
1M
1.0s
66tps
$0.43/M$0.87/M
Read:$0.0/M
Write:
deepinfra logo
deepseek logo
fireworks logo
+1
04/23/2026
164K
0.7s
107tps
$0.28/M$0.42/M
Read:$0.03/M
Write:
bedrock logo
deepinfra logo
deepseek logo
+2
12/01/2025
164K
0.8s
122tps
$0.28/M$0.42/M
Read:$0.03/M
Write:
bedrock logo
deepinfra logo
deepseek logo
+2
12/01/2025
131K
1.6s
28tps
$0.27/M$1.00/M
Read:$0.14/M
Write:
novita logo
09/22/2025
164K
0.3s
143tps
$0.77/M$0.77/M
Read:$0.14/M
Write:
baseten logo
novita logo
12/26/2024

About DeepSeek R1 0528

DeepSeek R1 0528 was released January 20, 2025 and breaks from conventional reasoning model training. Instead of relying on human-written reasoning traces, DeepSeek applied reinforcement learning directly to the base DeepSeek-V3 weights. Unconstrained RL let emergent behaviors like self-verification, self-reflection, and long chain-of-thought generation develop organically.

The architecture is a 671B Mixture-of-Experts (MoE) model that activates 37B parameters per forward pass. On AIME 2024, DeepSeek R1 0528 achieves 79.8% Pass@1, on par with OpenAI o1. On MATH-500 it reaches 97.3%. The release documentation also highlights strong code and general reasoning performance.

The MIT License is permissive: many proprietary reasoning models impose stricter restrictions. DeepSeek released six smaller derivatives alongside the full model. The 32B and 70B versions match OpenAI o1-mini performance, giving teams cost-efficient alternatives to the full 671B model.

What To Consider When Choosing a Provider

  • Configuration: DeepSeek R1 0528 generates verbose reasoning traces before final answers. Budget output tokens generously and account for variable response length when estimating costs.
  • Zero Data Retention: AI Gateway supports Zero Data Retention for this model via direct gateway requests (BYOK is not included). To configure this, check the documentation.
  • Authentication: AI Gateway authenticates requests using an API key or OIDC token. You do not need to manage provider credentials directly.

When to Use DeepSeek R1 0528

Best For

  • Competitive mathematics: Formal proof construction and quantitative reasoning where AIME 2024 and MATH-500 benchmark results match your task
  • Code generation and debugging: Algorithm design where RL-derived problem-solving patterns produce self-correcting chains before final output
  • Complex analytical reasoning: Multi-step reasoning in finance, science, and engineering where showing work and self-verification build trust

Consider Alternatives When

  • Conversation or summarization: Extended reasoning traces add unnecessary output token cost for content generation workloads
  • Hybrid thinking modes: DeepSeek-V3.1 or later supports both thinking and non-thinking modes through the same endpoint
  • Strict latency requirements: Variable response times from long reasoning chains are not acceptable when latency is a hard constraint
  • Pure creative writing: Structured reasoning adds no quality benefit for open-ended generation tasks

Conclusion

DeepSeek R1 0528 matches closed-source models on published benchmarks while shipping weights under the MIT License. For math, code, and formal reasoning workloads, it fits teams that need open weights.

Frequently Asked Questions

  • How was DeepSeek R1 0528 trained differently from other reasoning models?

    DeepSeek applied reinforcement learning directly to the base model, bypassing the conventional step of training on human-written reasoning traces. Reasoning patterns like self-verification and reflection emerged from RL exploration rather than curated data.

  • What are DeepSeek R1 0528's benchmark scores on mathematics?

    79.8% Pass@1 on AIME 2024, on par with OpenAI o1 at release. On MATH-500 it scores 97.3%.

  • What does the MIT License mean for using DeepSeek R1 0528 outputs commercially?

    The MIT License permits commercial use. Many proprietary reasoning models impose stricter restrictions.

  • What is the context window and architecture of DeepSeek R1 0528?

    A context window of 160K tokens. The architecture is Mixture-of-Experts (MoE) with 671B total parameters, activating 37B per forward pass.

  • When should I use DeepSeek R1 0528 versus DeepSeek-V3 or V3.1?

    DeepSeek R1 0528 specializes in deep reasoning with extended chain-of-thought. DeepSeek-V3 and later variants are general-purpose models that balance reasoning with faster, lower-cost completions and suit mixed-workload deployments better.

  • Does the reasoning trace appear in the API response?

    Yes. The chain-of-thought trace appears in the response. This helps with debugging and with applications that display the model's reasoning to end users.