Devstral 2 is a 123B open-weight agentic coding model with 72.2% on SWE-bench Verified, a context window of 256K tokens, and support for codebase exploration and multi-file orchestration.
import { streamText } from 'ai'
const result = streamText({ model: 'mistral/devstral-2', prompt: 'Why is the sky blue?'})What To Consider When Choosing a Provider
- Configuration: Devstral 2 is a 123B open-weight agentic coding model available through AI Gateway, so you can access its full capability without managing model infrastructure.
- Zero Data Retention: AI Gateway supports Zero Data Retention for this model via direct gateway requests (BYOK is not included). To configure this, check the documentation.
- Authentication: AI Gateway authenticates requests using an API key or OIDC token. You do not need to manage provider credentials directly.
When to Use Devstral 2
Best For
- Software engineering agents: Autonomous agents that resolve real GitHub issues end-to-end
- Multi-file codebase changes: Refactors and features that require deep dependency tracking across files
- Automated failure detection: Agents that run self-correction loops inside CI/CD pipelines
- Top open-weight SWE-bench scores: For teams prioritizing these benchmarks at release
Consider Alternatives When
- Smaller parameter footprint: You want a more compact model (consider Devstral Small 2)
- Multimodal agents: Image input is required (Devstral Small 2 supports images; Devstral 2 does not)
- Simple single-file generation: Your workload is not full agentic workflows
Conclusion
Devstral 2 reaches 72.2% on SWE-bench Verified. Devstral 2 combines a context window of 256K tokens and a modified MIT license that supports commercial deployment. Access Devstral 2 through AI Gateway for agentic coding workloads.
Frequently Asked Questions
What is Devstral 2's score on SWE-bench Verified?
72.2% on SWE-bench Verified, per Mistral AI's Devstral 2 announcement.
What is the context window?
256K tokens, enabling entire repositories, long test suites, and complex dependency chains to be held in context simultaneously.
What license does Devstral 2 use?
A modified MIT license that permits commercial deployments.
How does Devstral 2 differ from Devstral Small 2?
Devstral 2 (123B) outperforms Devstral Small 2 (24B) on benchmarks. Devstral Small 2 is 24B parameters, supports image inputs, and uses the Apache 2.0 license.
Can Devstral 2 handle failure detection and auto-correction?
Yes. Automated failure detection with auto-correction is a listed production workflow capability.