Qwen 3 Max Thinking

Qwen 3 Max Thinking is Alibaba Cloud's trillion-parameter reasoning model that autonomously deploys built-in search, memory, and code interpreter tools during inference, achieving a score of 49.8 on Humanity's Last Exam with search enabled. Your use subject to Alibaba Cloud's Terms & Privacy Policies.

ReasoningTool Use

Use with AI Gateway View docs

TypeScript

Python

import { streamText } from 'ai'

const result = streamText({
  model: 'alibaba/qwen3-max-thinking',
  prompt: 'Why is the sky blue?'
})

Read docs

Overview About Providers Throughput Latency Uptime Status Similar FAQ

About Qwen 3 Max Thinking

Qwen 3 Max Thinking, released on January 23, 2026, extends the Qwen3-Max architecture with a dedicated extended reasoning mode and integrated autonomous tool use. When Qwen 3 Max Thinking encounters a question that exceeds its internal knowledge or requires computation, it independently decides whether to trigger its Search tool (for current information), Memory tool (for cross-turn context persistence), or Code Interpreter (for numerical verification and data processing), without you needing to specify which tool applies.

This autonomous tool selection is a meaningful architectural distinction. Rather than exposing tool invocation as an explicit user-facing control, Qwen 3 Max Thinking treats it as an internal reasoning step, making the interaction feel more like working with a capable assistant that knows when to check its work. The design is intended to reduce hallucination risk on factual queries by defaulting to retrieval when confidence is low, and to improve numerical accuracy by routing computations through an interpreter.

Qwen 3 Max Thinking's thinking mode exposes its reasoning chain before delivering a final answer, providing transparency into multi-step problem decomposition. On Humanity's Last Exam, a benchmark of approximately 3,000 graduate-level questions spanning mathematics, science, and engineering, Qwen 3 Max Thinking with search enabled scored 49.8, competitive with other models on the same benchmark in Alibaba Cloud's published comparisons. You can access Qwen 3 Max Thinking through AI SDK, Chat Completions API, Responses API, Messages API, or other API formats, from TypeScript or Python.

Agent Stack

Core Platform

Tools

Learn

Build

Explore

Qwen 3 Max Thinking

About Qwen 3 Max Thinking