Qwen 3 32B

alibaba/qwen-3-32b

Qwen3-32B is a world-class model with comparable quality to DeepSeek R1 while outperforming GPT-4.1 and Claude Sonnet 3.7. It excels in code-gen, tool-calling, and advanced reasoning, making it an exceptional model for a wide range of production use cases.

ReasoningTool Use

import { streamText } from 'ai'

const result = streamText({
  model: 'alibaba/qwen-3-32b',
  prompt: 'Why is the sky blue?'
})

Playground

Try out Qwen 3 32B by Alibaba Cloud. Usage is billed to your team at API rates. Free users get $5 of credits every 30 days, and you are considered a free user if you haven't made a payment.

Chat with

Providers

The AI Gateway supports routing requests across multiple AI providers. You can control provider preferences using the provider slugs available for copying with the buttons below. For more see the AI Gateway provider options documentation. By using the AI provider you acknowledge you reviewed and agree to their terms listed in the Legal section under the AI provider's name.

Provider

Context	Max Output	Latency	Throughput	Input	Output	Cache	Image Gen	Video Gen	Web Search	Capabilities	ZDR	Release Date

Legal:Terms

•

Privacy

128K

0.4s

208tps

$0.15/M

$0.60/M

—

04/01/2025

Legal:Terms

•

Privacy

128K

0.9s

107tps

$0.16/M

$0.64/M

—

04/01/2025

Legal:Terms

•

Privacy

41K

16K

0.7s

18tps

$0.10/M

$0.30/M

—

04/01/2025

Legal:Terms

•

Privacy

131K

41K

0.2s

411tps

$0.29/M

$0.59/M

—

04/01/2025

Throughput

More models by Alibaba Cloud

Model

Context	Latency	Throughput	Input	Output	Capabilities	Providers	ZDR	Release Date

alibaba/qwen-3-14b	41K	0.5s	67tps	$0.06/M	$0.24/M		04/01/2025
alibaba/qwen-3-235b	131K	1.3s	37tps	$0.07/M	$0.46/M		04/01/2025
alibaba/qwen-3-30b	41K	0.5s	61tps	$0.08/M	$0.29/M		04/01/2025
alibaba/qwen3-235b-a22b-thinking	262K	0.3s	75tps	$0.30/M	$2.90/M		04/01/2025
alibaba/qwen3-coder	262K	0.7s	76tps	$0.40/M	$1.60/M	+1	04/01/2025
alibaba/qwen3-coder-30b-a3b	262K	1.0s	125tps	$0.15/M	$0.60/M		04/01/2025
alibaba/qwen3-coder-next	256K	0.5s	103tps	$0.50/M	$1.20/M		07/22/2025
alibaba/qwen3-coder-plus	1M	1.4s	45tps	$1.00/M	$5.00/M		07/23/2025
alibaba/qwen3-embedding-0.6b	33K			$0.01/M			11/14/2025
alibaba/qwen3-embedding-4b	33K			$0.02/M			06/05/2025
alibaba/qwen3-embedding-8b	33K			$0.05/M			06/05/2025
alibaba/qwen3-max	262K	1.0s	33tps	$1.20/M	$6.00/M		09/23/2025
alibaba/qwen3-max-preview	262K	2.3s	37tps	$1.20/M	$6.00/M		09/23/2025
alibaba/qwen3-max-thinking	256K	1.4s	33tps	$1.20/M	$6.00/M
alibaba/qwen3-next-80b-a3b-instruct	262K	0.5s	169tps	$0.09/M	$1.10/M		09/12/2025
alibaba/qwen3-next-80b-a3b-thinking	131K	1.1s	333tps	$0.15/M	$1.50/M		09/12/2025
alibaba/qwen3-vl-instruct	262K	1.1s	65tps	$0.20/M	$1.20/M	+1	09/24/2025
alibaba/qwen3-vl-thinking	256K	1.1s	72tps	$0.22/M	$0.88/M		09/24/2025
alibaba/qwen3.5-flash	1M	1.8s	156tps	$0.10/M	$0.40/M		02/24/2026
alibaba/qwen3.5-plus	1M	3.4s	73tps	$0.40/M	$2.40/M		02/16/2026
alibaba/wan-v2.5-t2v-preview							09/24/2025
alibaba/wan-v2.6-i2v							12/16/2025
alibaba/wan-v2.6-i2v-flash							12/16/2025
alibaba/wan-v2.6-r2v							12/16/2025
alibaba/wan-v2.6-r2v-flash							12/16/2025
alibaba/wan-v2.6-t2v							12/16/2025

Playground

Try out Qwen 3 32B by Alibaba Cloud. Usage is billed to your team at API rates. Free users get $5 of credits every 30 days, and you are considered a free user if you haven't made a payment.

Chat with

Providers

Provider

Context	Max Output	Latency	Throughput	Input	Output	Cache	Image Gen	Video Gen	Web Search	Capabilities	ZDR	Release Date

Legal:Terms

•

Privacy

128K

0.4s

208tps

$0.15/M

$0.60/M

—

04/01/2025

Legal:Terms

•

Privacy

128K

0.9s

107tps

$0.16/M

$0.64/M

—

04/01/2025

Legal:Terms

•

Privacy

41K

16K

0.7s

18tps

$0.10/M

$0.30/M

—

04/01/2025

Legal:Terms

•

Privacy

131K

41K

0.2s

411tps

$0.29/M

$0.59/M

—

04/01/2025

Throughput

More models by Alibaba Cloud

Model

Context	Latency	Throughput	Input	Output	Capabilities	Providers	ZDR	Release Date

alibaba/qwen-3-14b	41K	0.5s	67tps	$0.06/M	$0.24/M		04/01/2025
alibaba/qwen-3-235b	131K	1.3s	37tps	$0.07/M	$0.46/M		04/01/2025
alibaba/qwen-3-30b	41K	0.5s	61tps	$0.08/M	$0.29/M		04/01/2025
alibaba/qwen3-235b-a22b-thinking	262K	0.3s	75tps	$0.30/M	$2.90/M		04/01/2025
alibaba/qwen3-coder	262K	0.7s	76tps	$0.40/M	$1.60/M	+1	04/01/2025
alibaba/qwen3-coder-30b-a3b	262K	1.0s	125tps	$0.15/M	$0.60/M		04/01/2025
alibaba/qwen3-coder-next	256K	0.5s	103tps	$0.50/M	$1.20/M		07/22/2025
alibaba/qwen3-coder-plus	1M	1.4s	45tps	$1.00/M	$5.00/M		07/23/2025
alibaba/qwen3-embedding-0.6b	33K			$0.01/M			11/14/2025
alibaba/qwen3-embedding-4b	33K			$0.02/M			06/05/2025
alibaba/qwen3-embedding-8b	33K			$0.05/M			06/05/2025
alibaba/qwen3-max	262K	1.0s	33tps	$1.20/M	$6.00/M		09/23/2025
alibaba/qwen3-max-preview	262K	2.3s	37tps	$1.20/M	$6.00/M		09/23/2025
alibaba/qwen3-max-thinking	256K	1.4s	33tps	$1.20/M	$6.00/M
alibaba/qwen3-next-80b-a3b-instruct	262K	0.5s	169tps	$0.09/M	$1.10/M		09/12/2025
alibaba/qwen3-next-80b-a3b-thinking	131K	1.1s	333tps	$0.15/M	$1.50/M		09/12/2025
alibaba/qwen3-vl-instruct	262K	1.1s	65tps	$0.20/M	$1.20/M	+1	09/24/2025
alibaba/qwen3-vl-thinking	256K	1.1s	72tps	$0.22/M	$0.88/M		09/24/2025
alibaba/qwen3.5-flash	1M	1.8s	156tps	$0.10/M	$0.40/M		02/24/2026
alibaba/qwen3.5-plus	1M	3.4s	73tps	$0.40/M	$2.40/M		02/16/2026
alibaba/wan-v2.5-t2v-preview							09/24/2025
alibaba/wan-v2.6-i2v							12/16/2025
alibaba/wan-v2.6-i2v-flash							12/16/2025
alibaba/wan-v2.6-r2v							12/16/2025
alibaba/wan-v2.6-r2v-flash							12/16/2025
alibaba/wan-v2.6-t2v							12/16/2025

AI Cloud

Core Platform

Security

Company

Learn

Open Source

Use Cases

Tools

Users

Qwen 3 32B

Playground

Providers

More models by Alibaba Cloud

Playground

Providers

More models by Alibaba Cloud