1 min read
Fast mode for Claude Opus 4.7 is now available on AI Gateway in research preview.
Fast mode delivers ~2.5x faster output token generation with full Opus 4.7 intelligence. This is an early, experimental feature.
To enable fast mode, pass speed: 'fast' in the anthropic provider options with anthropic/claude-opus-4.7.
import { streamText } from "ai";
const { text } = await streamText({ model: "anthropic/claude-opus-4.7", prompt: "Analyze this codebase structure and create a plan to add user auth.", providerOptions: { anthropic: { speed: "fast", }, },});You can use fast mode with Claude Code via AI Gateway by setting the CLAUDE_CODE_SKIP_FAST_MODE_ORG_CHECK and CLAUDE_CODE_ENABLE_OPUS_4_7_FAST_MODE variables in your shell configuration file or in ~/.claude/settings.json.
export CLAUDE_CODE_ENABLE_OPUS_4_7_FAST_MODE=1export CLAUDE_CODE_SKIP_FAST_MODE_ORG_CHECK=1{ "env": { "CLAUDE_CODE_SKIP_FAST_MODE_ORG_CHECK": "1", "CLAUDE_CODE_ENABLE_OPUS_4_7_FAST_MODE": "1" }}Fast mode is priced at 6x standard Opus rates.
All standard pricing multipliers (e.g., prompt caching) apply on top of these rates.
AI Gateway: Track top AI models by usage
The AI Gateway model leaderboard ranks the most used models over time by total token volume across all traffic through the Gateway. Updates regularly.
View the leaderboard