Devstral 2
Devstral 2 is a 123B open-weight agentic coding model with 72.2% on SWE-bench Verified, a context window of 256K tokens, and support for codebase exploration and multi-file orchestration.
import { streamText } from 'ai'
const result = streamText({ model: 'mistral/devstral-2', prompt: 'Why is the sky blue?'})Playground
Try out Devstral 2 by Mistral AI. Usage is billed to your team at API rates. Free users (those who haven't made a payment) get $5 of credits every 30 days.
Providers
Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.
| Provider |
|---|
P50 throughput on live AI Gateway traffic, in tokens per second (TPS). Visit the docs for more info.
P50 time to first token (TTFT) on live AI Gateway traffic, in milliseconds. View the docs for more info.
Direct request success rate on AI Gateway and per-provider. Visit the docs for more info.
More models by Mistral AI
| Model |
|---|
About Devstral 2
Devstral 2 scores 72.2% on SWE-bench Verified per Mistral AI's Devstral 2 announcement. At 123B parameters, it's the larger tier in the Devstral 2 family.
The context window of 256K tokens lets Devstral 2 hold entire repositories, long dependency chains, or extensive test suites in context simultaneously. Multi-file orchestration and dependency tracking in production codebases demand this capacity. Devstral 2 supports codebase exploration, multi-file changes, failure detection, and auto-correction within a single agent loop.
Devstral 2 ships under a modified MIT license, making it suitable for commercial deployments.
What To Consider When Choosing a Provider
- Configuration: Devstral 2 is a 123B open-weight agentic coding model available through AI Gateway, so you can access its full capability without managing model infrastructure.
- Zero Data Retention: AI Gateway supports Zero Data Retention for this model via direct gateway requests (BYOK is not included). To configure this, check the documentation.
- Authentication: AI Gateway authenticates requests using an API key or OIDC token. You do not need to manage provider credentials directly.
When to Use Devstral 2
Best For
- Software engineering agents: Autonomous agents that resolve real GitHub issues end-to-end
- Multi-file codebase changes: Refactors and features that require deep dependency tracking across files
- Automated failure detection: Agents that run self-correction loops inside CI/CD pipelines
- Top open-weight SWE-bench scores: For teams prioritizing these benchmarks at release
Consider Alternatives When
- Smaller parameter footprint: You want a more compact model (consider Devstral Small 2)
- Multimodal agents: Image input is required (Devstral Small 2 supports images; Devstral 2 does not)
- Simple single-file generation: Your workload is not full agentic workflows
Conclusion
Devstral 2 reaches 72.2% on SWE-bench Verified. Devstral 2 combines a context window of 256K tokens and a modified MIT license that supports commercial deployment. Access Devstral 2 through AI Gateway for agentic coding workloads.
Frequently Asked Questions
What is Devstral 2's score on SWE-bench Verified?
72.2% on SWE-bench Verified, per Mistral AI's Devstral 2 announcement.
What is the context window?
256K tokens, enabling entire repositories, long test suites, and complex dependency chains to be held in context simultaneously.
What license does Devstral 2 use?
A modified MIT license that permits commercial deployments.
How does Devstral 2 differ from Devstral Small 2?
Devstral 2 (123B) outperforms Devstral Small 2 (24B) on benchmarks. Devstral Small 2 is 24B parameters, supports image inputs, and uses the Apache 2.0 license.
Can Devstral 2 handle failure detection and auto-correction?
Yes. Automated failure detection with auto-correction is a listed production workflow capability.