Qwen 3 Max Thinking

Qwen 3 Max Thinking is Alibaba Cloud's trillion-parameter reasoning model that autonomously deploys built-in search, memory, and code interpreter tools during inference, achieving a score of 49.8 on Humanity's Last Exam with search enabled. Your use subject to Alibaba Cloud's Terms & Privacy Policies.

ReasoningTool Use

Use with AI Gateway View docs

TypeScript

Python

import { streamText } from 'ai'

const result = streamText({
  model: 'alibaba/qwen3-max-thinking',
  prompt: 'Why is the sky blue?'
})

Read docs

Overview About Providers Throughput Latency Uptime Status Similar FAQ

More models by Alibaba Cloud

Model

Context	Latency	Throughput	Input	Output	Cache	Web Search	Capabilities	Providers	ZDR	No Training	Release Date

alibaba/qwen3.7-flash

991K

2.6s

115tps

$0.03/M

$0.13/M

Read:

$0.01/M

Write:

$0.04/M

—

07/28/2026

alibaba/qwen3.7-plus

2.9s

261tps

$0.32/M

$1.28/M

Read:$0.08/M

Write:$0.5/M

—

06/02/2026

alibaba/qwen3.7-max

991K

2.4s

55tps

$2.50/M

$7.50/M

Read:$0.5/M

Write:$3.13/M

—

05/21/2026

alibaba/qwen3.5-flash

2.5s

153tps

$0.10/M

$0.40/M

Read:$0.0/M

Write:$0.13/M

—

02/24/2026

alibaba/qwen3-embedding-0.6b

33K

$0.01/M

—

11/14/2025

alibaba/qwen3-embedding-4b

33K

$0.02/M

—

06/05/2025

Agent Stack

Core Platform

Tools

Learn

Build

Explore

Qwen 3 Max Thinking

More models by Alibaba Cloud