DeepSeek V3.1

DeepSeek V3.1 is DeepSeek's August 21, 2025 model update introducing hybrid inference with selectable thinking and non-thinking modes in one endpoint. It strengthens tool use and multi-step agent capabilities over DeepSeek-V3.

Implicit CachingReasoningTool Use

index.ts

import { streamText } from 'ai'

const result = streamText({
  model: 'deepseek/deepseek-v3.1',
  prompt: 'Why is the sky blue?'
})

Overview About Providers Throughput Latency Uptime Status Similar FAQ

About DeepSeek V3.1

DeepSeek V3.1 was released August 21, 2025. Its central change consolidates thinking and non-thinking inference into one model. Access non-thinking mode via the deepseek-chat API identifier and thinking mode via deepseek-reasoner. Previously these required separate deployments. The dual-mode design lets you route requests to different inference behaviors without maintaining separate integrations, simplifying agent architectures where some steps need reasoning and others don't.

The thinking mode offers improved efficiency over prior reasoning models. Strict function calling is available in beta, alongside Anthropic API format compatibility, expanding the range of infrastructure that can route to DeepSeek V3.1 without modification.

DeepSeek V3.1 targets stronger multi-step reasoning for complex search tasks, better performance on SWE-Bench and Terminal-Bench, and a new tokenizer with a refreshed chat template. Current AI Gateway rates appear on this page.

Agent Stack

Core Platform

Tools

Learn

Build

Explore

DeepSeek V3.1

About DeepSeek V3.1