DeepSeek V4 Pro
DeepSeek V4 Pro is DeepSeek's April 23, 2026 top-tier model in the V4 series. It pairs a hybrid attention architecture with a context window of 1.0M tokens and targets complex reasoning, multi-step problem solving, and agentic tasks.
import { streamText } from 'ai'
const result = streamText({ model: 'deepseek/deepseek-v4-pro', prompt: 'Why is the sky blue?'})Frequently Asked Questions
When should I pick DeepSeek V4 Pro over DeepSeek V4 Flash?
Pick DeepSeek V4 Pro for complex reasoning, multi-step problem solving, and agentic tool orchestration. Use DeepSeek V4 Flash for short-form instruction following, classification, and high-volume routing where per-token cost dominates.
What is the context window and max output for DeepSeek V4 Pro?
The context window is 1.0M tokens and the maximum output is 1.0M tokens.
What is the V4 hybrid attention architecture?
DeepSeek V4 Pro combines Compressed Sparse Attention (CSA) with Heavily Compressed Attention (HCA), and uses ManifoldConstrained Hyper-Connections (mHC) in place of standard residual connections. The combination targets efficient inference at long context.
Does DeepSeek V4 Pro support tool calls inside reasoning steps?
Yes. DeepSeek V4 Pro is tagged for reasoning and tool use, so agent pipelines can plan, call tools, integrate results, and iterate in a single call through the AI SDK or the Chat Completions, Responses, or Messages API formats.
How does implicit caching affect pricing for DeepSeek V4 Pro?
Repeated input prefixes (typically long system prompts) are detected automatically and charged at the cached input rate of $0.003625 per token instead of $0.435. No cache-control headers are required.
Do I need a DeepSeek platform account to use DeepSeek V4 Pro?
No. Access DeepSeek V4 Pro through AI Gateway with an AI Gateway API key or OIDC token.
Does DeepSeek V4 Pro support zero data retention?
Yes, Zero Data Retention is available for this model. Zero Data Retention is offered on a per-provider basis. See https://vercel.com/docs/ai-gateway/capabilities/zdr for details.