Qwen3 Next 80B A3B Thinking

Qwen3 Next 80B A3B Thinking is a hybrid Transformer-Mamba reasoning model that combines 80 billion total parameters (3B active per token) with a dedicated thinking mode, achieving strong results on AIME25 while supporting ultra-long contexts of 262.1K tokens.

ReasoningTool Use

index.ts

import { streamText } from 'ai'

const result = streamText({
  model: 'alibaba/qwen3-next-80b-a3b-thinking',
  prompt: 'Why is the sky blue?'
})

Overview About Providers Throughput Latency Uptime Status Similar FAQ

More models by Alibaba Cloud

Model

Context	Latency	Throughput	Input	Output	Cache	Web Search	Capabilities	Providers	ZDR	No Training	Release Date

alibaba/qwen3.7-plus

2.6s

206tps

$0.32/M

$1.28/M

Read:$0.08/M

Write:$0.5/M

—

06/02/2026

alibaba/qwen3.7-max

991K

3.1s

55tps

$1.25/M

$3.75/M

Read:$0.25/M

Write:$1.56/M

—

05/21/2026

alibaba/qwen-3.6-max-preview

240K

2.1s

110tps

$1.30/M

$7.80/M

Read:

$0.26/M

Write:

$1.63/M

—

04/20/2026

alibaba/qwen3.6-plus

1.6s

110tps

$0.50/M

$3/M

Read:

$0.1/M

Write:

$0.63/M

—

04/02/2026

alibaba/qwen3.5-flash

1.1s

178tps

$0.10/M

$0.40/M

Read:$0.0/M

Write:$0.13/M

—

02/24/2026

alibaba/qwen3-embedding-0.6b

33K

$0.01/M

—

11/14/2025

Agent Stack

Core Platform

Tools

Learn

Build

Explore

Qwen3 Next 80B A3B Thinking

More models by Alibaba Cloud