Skip to content

Qwen 3 Max Thinking

Qwen 3 Max Thinking is Alibaba's trillion-parameter reasoning model that autonomously deploys built-in search, memory, and code interpreter tools during inference, achieving a score of 49.8 on Humanity's Last Exam with search enabled.

ReasoningTool UseImplicit Caching
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'alibaba/qwen3-max-thinking',
prompt: 'Why is the sky blue?'
})
Throughput

P50 throughput on live AI Gateway traffic, in tokens per second (TPS). Visit the docs for more info.