DeepSeek V4 Pro

DeepSeek V4 Pro is DeepSeek's April 23, 2026 top-tier model in the V4 series. It pairs a hybrid attention architecture with a context window of 1.0M tokens and targets complex reasoning, multi-step problem solving, and agentic tasks.

ReasoningTool UseImplicit Caching

index.ts

import { streamText } from 'ai'

const result = streamText({
  model: 'deepseek/deepseek-v4-pro',
  prompt: 'Why is the sky blue?'
})

Overview About Providers Throughput Latency Uptime Status Similar FAQ

About DeepSeek V4 Pro

DeepSeek V4 Pro was released April 23, 2026 as the high-capability tier of DeepSeek's V4 generation. The V4 series introduces a hybrid attention architecture that combines Compressed Sparse Attention (CSA) with Heavily Compressed Attention (HCA), and uses ManifoldConstrained Hyper-Connections (mHC) in place of standard residual connections. The combination supports efficient inference at the 1.0M tokens window.

DeepSeek V4 Pro is positioned for complex reasoning, multi-step problem solving, and agentic workflows. Tool use, reasoning, and implicit caching are all supported, so DeepSeek V4 Pro fits planner-style pipelines where the model decides on tool calls, integrates results, and iterates toward an answer. Maximum output is 1.0M tokens, which gives long-form reasoning chains and tool-call sequences room to complete in a single response.

Access is through AI Gateway with an AI Gateway API key or OIDC token. You can integrate through the AI SDK, Chat Completions, Responses, or Messages API formats. Implicit caching applies when a long input prefix repeats across calls, charging the cached input rate instead of the standard input rate for cached tokens.

Agent Stack

Core Platform

Tools

Learn

Build

Explore

DeepSeek V4 Pro

About DeepSeek V4 Pro