Question 1

How do I enable the context window of 1M tokens for Claude Sonnet 4 on AI Gateway?

Accepted Answer

Add the `anthropic-beta: context-1m-2025-08-07` header to your request. Under `providerOptions.gateway`, set `only` to `['anthropic']` so the request routes through the Anthropic provider, which supports the feature.

Question 2

What does the context window of 1M tokens enable in practice?

Accepted Answer

The context window of 1M tokens lets you process entire codebases, long documents, or extended conversation histories in a single request. This is particularly useful for code review across multiple files, document analysis, and agentic workflows that accumulate context over many steps.

Question 3

How did Claude Sonnet 4 perform on SWE-bench Verified?

Accepted Answer

72.7% on SWE-bench Verified, matching or exceeding Claude Opus 4's 72.5% on that specific benchmark.

Question 4

What is enhanced steerability in Claude Sonnet 4?

Accepted Answer

Sonnet 4 responds more precisely to instructions, reducing misinterpretation of complex or nuanced prompts. Anthropic highlighted steerability as an explicit design improvement for applications where exact specification of behavior matters.

Question 5

Does Claude Sonnet 4 support extended thinking?

Accepted Answer

Yes. Sonnet 4 is a hybrid model that supports both near-instant responses and extended thinking. Extended thinking with tool use, where the model alternates between reasoning and calling tools, is also available in beta.

Question 6

What is 1-hour prompt caching and does Sonnet 4 support it?

Accepted Answer

Yes. The Claude 4 launch introduced one-hour prompt caching as a new API capability, compared to shorter-lived caching in previous generations. This is particularly useful for codebases or large system prompts that appear in many requests.

Question 7

Why would I use Sonnet 4 instead of Opus 4 given the SWE-bench scores are similar?

Accepted Answer

Claude Sonnet 4 is priced at the Sonnet tier, while Opus 4 is priced at the Opus tier. When benchmark results are comparable, the cost gap determines the choice at scale. Check the pricing panel on this page for current rates.

Agent Stack

Core Platform

Tools

Learn

Build

Explore

Claude Sonnet 4

Frequently Asked Questions