Qwen3 Max

alibaba/qwen3-max

The Qwen 3 series Max model has undergone specialized upgrades in agent programming and tool invocation compared to the preview version. The officially released model this time has achieved state-of-the-art (SOTA) performance in its field and is better suited to meet the demands of agents operating in more complex scenarios.

Tool UseImplicit Caching

import { streamText } from 'ai'

const result = streamText({
  model: 'alibaba/qwen3-max',
  prompt: 'Why is the sky blue?'
})

Playground

Try out Qwen3 Max by Alibaba Cloud. Usage is billed to your team at API rates. Free users get $5 of credits every 30 days, and you are considered a free user if you haven't made a payment.

Chat with

Providers

The AI Gateway supports routing requests across multiple AI providers. You can control provider preferences using the provider slugs available for copying with the buttons below. For more see the AI Gateway provider options documentation. By using the AI provider you acknowledge you reviewed and agree to their terms listed in the Legal section under the AI provider's name.

Provider

Context	Max Output	Latency	Throughput	Input	Output	Cache	Image Gen	Video Gen	Web Search	Capabilities	ZDR	Release Date

Legal:Terms

•

Privacy

262K

33K

2.0s

33tps

$1.20/M

$6.00/M

Read:

$0.24/M

Write:

—

09/23/2025

Legal:Terms

•

Privacy

262K

66K

1.2s

31tps

$0.84/M

$3.38/M

—

09/23/2025

Throughput

More models by Alibaba Cloud

Model

Context	Latency	Throughput	Input	Output	Capabilities	Providers	ZDR	Release Date

alibaba/qwen-3-14b	41K	0.6s	80tps	$0.06/M	$0.24/M		04/01/2025
alibaba/qwen-3-235b	131K	0.4s	32tps	$0.07/M	$0.46/M		04/01/2025
alibaba/qwen-3-30b	41K	0.3s	54tps	$0.08/M	$0.29/M		04/01/2025
alibaba/qwen-3-32b	131K	0.3s	305tps	$0.10/M	$0.30/M	+1	04/01/2025
alibaba/qwen3-235b-a22b-thinking	262K	0.5s	76tps	$0.30/M	$2.90/M		04/01/2025
alibaba/qwen3-coder	262K	0.6s	91tps	$0.40/M	$1.60/M	+1	04/01/2025
alibaba/qwen3-coder-30b-a3b	262K	1.3s	131tps	$0.15/M	$0.60/M		04/01/2025
alibaba/qwen3-coder-next	256K	1.3s	51tps	$0.50/M	$1.20/M		07/22/2025
alibaba/qwen3-coder-plus	1M	1.2s	50tps	$1.00/M	$5.00/M		07/23/2025
alibaba/qwen3-embedding-0.6b	33K			$0.01/M			11/14/2025
alibaba/qwen3-embedding-4b	33K			$0.02/M			06/05/2025
alibaba/qwen3-embedding-8b	33K			$0.05/M			06/05/2025
alibaba/qwen3-max-preview	262K	1.9s	39tps	$1.20/M	$6.00/M		09/23/2025
alibaba/qwen3-max-thinking	256K	1.5s	33tps	$1.20/M	$6.00/M
alibaba/qwen3-next-80b-a3b-instruct	262K	0.4s	135tps	$0.09/M	$1.10/M		09/12/2025
alibaba/qwen3-next-80b-a3b-thinking	131K	1.0s	312tps	$0.15/M	$1.50/M		09/12/2025
alibaba/qwen3-vl-instruct	262K	0.8s	54tps	$0.20/M	$1.20/M	+1	09/24/2025
alibaba/qwen3-vl-thinking	256K	0.8s	85tps	$0.22/M	$0.88/M		09/24/2025
alibaba/qwen3.5-flash	1M	1.7s	178tps	$0.10/M	$0.40/M		02/24/2026
alibaba/qwen3.5-plus	1M	2.7s	64tps	$0.40/M	$2.40/M		02/16/2026
alibaba/wan-v2.5-t2v-preview							09/24/2025
alibaba/wan-v2.6-i2v							12/16/2025
alibaba/wan-v2.6-i2v-flash							12/16/2025
alibaba/wan-v2.6-r2v							12/16/2025
alibaba/wan-v2.6-r2v-flash							12/16/2025
alibaba/wan-v2.6-t2v							12/16/2025

Playground

Try out Qwen3 Max by Alibaba Cloud. Usage is billed to your team at API rates. Free users get $5 of credits every 30 days, and you are considered a free user if you haven't made a payment.

Chat with

Providers

Provider

Context	Max Output	Latency	Throughput	Input	Output	Cache	Image Gen	Video Gen	Web Search	Capabilities	ZDR	Release Date

Legal:Terms

•

Privacy

262K

33K

2.0s

33tps

$1.20/M

$6.00/M

Read:

$0.24/M

Write:

—

09/23/2025

Legal:Terms

•

Privacy

262K

66K

1.2s

31tps

$0.84/M

$3.38/M

—

09/23/2025

Throughput

More models by Alibaba Cloud

Model

Context	Latency	Throughput	Input	Output	Capabilities	Providers	ZDR	Release Date

alibaba/qwen-3-14b	41K	0.6s	80tps	$0.06/M	$0.24/M		04/01/2025
alibaba/qwen-3-235b	131K	0.4s	32tps	$0.07/M	$0.46/M		04/01/2025
alibaba/qwen-3-30b	41K	0.3s	54tps	$0.08/M	$0.29/M		04/01/2025
alibaba/qwen-3-32b	131K	0.3s	305tps	$0.10/M	$0.30/M	+1	04/01/2025
alibaba/qwen3-235b-a22b-thinking	262K	0.5s	76tps	$0.30/M	$2.90/M		04/01/2025
alibaba/qwen3-coder	262K	0.6s	91tps	$0.40/M	$1.60/M	+1	04/01/2025
alibaba/qwen3-coder-30b-a3b	262K	1.3s	131tps	$0.15/M	$0.60/M		04/01/2025
alibaba/qwen3-coder-next	256K	1.3s	51tps	$0.50/M	$1.20/M		07/22/2025
alibaba/qwen3-coder-plus	1M	1.2s	50tps	$1.00/M	$5.00/M		07/23/2025
alibaba/qwen3-embedding-0.6b	33K			$0.01/M			11/14/2025
alibaba/qwen3-embedding-4b	33K			$0.02/M			06/05/2025
alibaba/qwen3-embedding-8b	33K			$0.05/M			06/05/2025
alibaba/qwen3-max-preview	262K	1.9s	39tps	$1.20/M	$6.00/M		09/23/2025
alibaba/qwen3-max-thinking	256K	1.5s	33tps	$1.20/M	$6.00/M
alibaba/qwen3-next-80b-a3b-instruct	262K	0.4s	135tps	$0.09/M	$1.10/M		09/12/2025
alibaba/qwen3-next-80b-a3b-thinking	131K	1.0s	312tps	$0.15/M	$1.50/M		09/12/2025
alibaba/qwen3-vl-instruct	262K	0.8s	54tps	$0.20/M	$1.20/M	+1	09/24/2025
alibaba/qwen3-vl-thinking	256K	0.8s	85tps	$0.22/M	$0.88/M		09/24/2025
alibaba/qwen3.5-flash	1M	1.7s	178tps	$0.10/M	$0.40/M		02/24/2026
alibaba/qwen3.5-plus	1M	2.7s	64tps	$0.40/M	$2.40/M		02/16/2026
alibaba/wan-v2.5-t2v-preview							09/24/2025
alibaba/wan-v2.6-i2v							12/16/2025
alibaba/wan-v2.6-i2v-flash							12/16/2025
alibaba/wan-v2.6-r2v							12/16/2025
alibaba/wan-v2.6-r2v-flash							12/16/2025
alibaba/wan-v2.6-t2v							12/16/2025

AI Cloud

Core Platform

Security

Company

Learn

Open Source

Use Cases

Tools

Users

Qwen3 Max

Playground

Providers

More models by Alibaba Cloud

Playground

Providers

More models by Alibaba Cloud