Skip to content

Devstral 2

View Status

Devstral 2 is a 123B open-weight agentic coding model with 72.2% on SWE-bench Verified, a context window of 256K tokens, and support for codebase exploration and multi-file orchestration.

Tool Use
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'mistral/devstral-2',
prompt: 'Why is the sky blue?'
})

What To Consider When Choosing a Provider

  • Configuration: Devstral 2 is a 123B open-weight agentic coding model available through AI Gateway, so you can access its full capability without managing model infrastructure.
  • Zero Data Retention: AI Gateway supports Zero Data Retention for this model via direct gateway requests (BYOK is not included). To configure this, check the documentation.
  • Authentication: AI Gateway authenticates requests using an API key or OIDC token. You do not need to manage provider credentials directly.

When to Use Devstral 2

Best For

  • Software engineering agents: Autonomous agents that resolve real GitHub issues end-to-end
  • Multi-file codebase changes: Refactors and features that require deep dependency tracking across files
  • Automated failure detection: Agents that run self-correction loops inside CI/CD pipelines
  • Top open-weight SWE-bench scores: For teams prioritizing these benchmarks at release

Consider Alternatives When

  • Smaller parameter footprint: You want a more compact model (consider Devstral Small 2)
  • Multimodal agents: Image input is required (Devstral Small 2 supports images; Devstral 2 does not)
  • Simple single-file generation: Your workload is not full agentic workflows

Conclusion

Devstral 2 reaches 72.2% on SWE-bench Verified. Devstral 2 combines a context window of 256K tokens and a modified MIT license that supports commercial deployment. Access Devstral 2 through AI Gateway for agentic coding workloads.

Frequently Asked Questions

  • What is Devstral 2's score on SWE-bench Verified?

    72.2% on SWE-bench Verified, per Mistral AI's Devstral 2 announcement.

  • What is the context window?

    256K tokens, enabling entire repositories, long test suites, and complex dependency chains to be held in context simultaneously.

  • What license does Devstral 2 use?

    A modified MIT license that permits commercial deployments.

  • How does Devstral 2 differ from Devstral Small 2?

    Devstral 2 (123B) outperforms Devstral Small 2 (24B) on benchmarks. Devstral Small 2 is 24B parameters, supports image inputs, and uses the Apache 2.0 license.

  • Can Devstral 2 handle failure detection and auto-correction?

    Yes. Automated failure detection with auto-correction is a listed production workflow capability.