DeepSeek V3.1 was released August 21, 2025. Its central change consolidates thinking and non-thinking inference into one model. Access non-thinking mode via the deepseek-chat API identifier and thinking mode via deepseek-reasoner. Previously these required separate deployments. The dual-mode design lets you route requests to different inference behaviors without maintaining separate integrations, simplifying agent architectures where some steps need reasoning and others don't.
The thinking mode offers improved efficiency over prior reasoning models. Strict function calling is available in beta, alongside Anthropic API format compatibility, expanding the range of infrastructure that can route to DeepSeek V3.1 without modification.
DeepSeek V3.1 targets stronger multi-step reasoning for complex search tasks, better performance on SWE-Bench and Terminal-Bench, and a new tokenizer with a refreshed chat template. Current AI Gateway rates appear on this page.