Skip to content

DeepSeek R1 0528

DeepSeek R1 0528 is DeepSeek's open-source reasoning model, released January 20, 2025. It scores 79.8% Pass@1 on AIME 2024 and 97.3% on MATH-500. Weights ship under the MIT License for commercial use.

ReasoningImplicit Caching
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'deepseek/deepseek-r1',
prompt: 'Why is the sky blue?'
})

Frequently Asked Questions

  • How was DeepSeek R1 0528 trained differently from other reasoning models?

    DeepSeek applied reinforcement learning directly to the base model, bypassing the conventional step of training on human-written reasoning traces. Reasoning patterns like self-verification and reflection emerged from RL exploration rather than curated data.

  • What are DeepSeek R1 0528's benchmark scores on mathematics?

    79.8% Pass@1 on AIME 2024, on par with OpenAI o1 at release. On MATH-500 it scores 97.3%.

  • What does the MIT License mean for using DeepSeek R1 0528 outputs commercially?

    The MIT License permits commercial use. Many proprietary reasoning models impose stricter restrictions.

  • What is the context window and architecture of DeepSeek R1 0528?

    A context window of 160K tokens. The architecture is Mixture-of-Experts (MoE) with 671B total parameters, activating 37B per forward pass.

  • When should I use DeepSeek R1 0528 versus DeepSeek-V3 or V3.1?

    DeepSeek R1 0528 specializes in deep reasoning with extended chain-of-thought. DeepSeek-V3 and later variants are general-purpose models that balance reasoning with faster, lower-cost completions and suit mixed-workload deployments better.

  • Does the reasoning trace appear in the API response?

    Yes. The chain-of-thought trace appears in the response. This helps with debugging and with applications that display the model's reasoning to end users.