What is adaptive thinking and how does it differ from extended thinking?

Adaptive thinking (`thinking: { type: 'adaptive' }`) lets the model decide when and how much to reason internally. Unlike extended thinking, you don't set a fixed thinking token budget. The model calibrates reasoning depth per request.

How is the context window of 1M tokens significant for Opus 4.6?

This is the first Opus model to support 1M tokens. Previous Opus models had a 200K context limit. This enables Opus-level analysis of entire large codebases, extensive document collections, or long agent histories in a single request.

Can Opus 4.6 interleave thinking and tool calls in one response?

Yes. Claude Opus 4.6 can interleave thinking and tool calls within a single response, reasoning about a problem, calling a tool, reasoning about the result, and calling another tool, all in one turn.

How do I configure adaptive thinking and effort together in the AI SDK?

Under `providerOptions.anthropic` in the AI SDK, set `thinking.type` to `adaptive` and pass an `effort` level (for example, `max`). Set the model to `anthropic/claude-opus-4.6`.

Does the effort parameter work the same way in Opus 4.6 as in Opus 4.5?

Yes, the effort parameter controls overall token usage across all token types, defaults to a high level, and operates independently of the thinking configuration. Both parameters can be set separately.

What real-world work is Opus 4.6 specifically designed to handle?

Programming, analysis, and creative tasks across the development lifecycle. The hybrid reasoning architecture lets you use extended thinking for complex problems while keeping latency low for straightforward queries.

Claude Opus 4.6

Claude Opus 4.6 is the first Opus model with a context window of 1M tokens, introduces adaptive thinking for model-decided reasoning depth, supports interleaved thinking and tool calls in a single response, and delivers equal or better performance than fixed extended thinking across programming, analysis, and creative tasks.

Tool UseReasoningVision (Image)File InputExplicit CachingWeb Search

import { streamText } from 'ai'

const result = streamText({
  model: 'anthropic/claude-opus-4.6',
  prompt: 'Why is the sky blue?',
  providerOptions: {
    anthropic: {
      speed: 'fast',
    },
    gateway: {
      only: ['anthropic'],
    },
  },
})

Overview Playground About Providers Throughput Latency Uptime Status Similar FAQ

Frequently Asked Questions

What is adaptive thinking and how does it differ from extended thinking?
Adaptive thinking (thinking: { type: 'adaptive' }) lets the model decide when and how much to reason internally. Unlike extended thinking, you don't set a fixed thinking token budget. The model calibrates reasoning depth per request.
How is the context window of 1M tokens significant for Opus 4.6?
This is the first Opus model to support 1M tokens. Previous Opus models had a 200K context limit. This enables Opus-level analysis of entire large codebases, extensive document collections, or long agent histories in a single request.
Can Opus 4.6 interleave thinking and tool calls in one response?
Yes. Claude Opus 4.6 can interleave thinking and tool calls within a single response, reasoning about a problem, calling a tool, reasoning about the result, and calling another tool, all in one turn.
How do I configure adaptive thinking and effort together in the AI SDK?
Under providerOptions.anthropic in the AI SDK, set thinking.type to adaptive and pass an effort level (for example, max). Set the model to anthropic/claude-opus-4.6.
Does the effort parameter work the same way in Opus 4.6 as in Opus 4.5?
Yes, the effort parameter controls overall token usage across all token types, defaults to a high level, and operates independently of the thinking configuration. Both parameters can be set separately.
What real-world work is Opus 4.6 specifically designed to handle?
Programming, analysis, and creative tasks across the development lifecycle. The hybrid reasoning architecture lets you use extended thinking for complex problems while keeping latency low for straightforward queries.
When was Opus 4.6 made available on AI Gateway?
February 5, 2026.

AI Cloud

Core Platform

Security

Company

Learn

Open Source

Use Cases

Tools

Users

Claude Opus 4.6

Frequently Asked Questions