Skip to content

Qwen3 Next 80B A3B Thinking

Qwen3 Next 80B A3B Thinking is a hybrid Transformer-Mamba reasoning model that combines 80 billion total parameters (3B active per token) with a dedicated thinking mode, achieving strong results on AIME25 while supporting ultra-long contexts of 262.1K tokens.

index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'alibaba/qwen3-next-80b-a3b-thinking',
prompt: 'Why is the sky blue?'
})

Providers

Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.

Provider
Context
Latency
Throughput
Input
Output
Cache
Web Search
Per Query
Capabilities
ZDR
No Training
Release Date
Alibaba
131K
0.5s
280tps
$0.15/M$1.20/M
09/12/2025
Novita AI
66K
0.9s
428tps
$0.15/M$1.50/M
09/12/2025
Google Vertex AI
262K
0.3s
118tps
$0.15/M$1.20/M
09/12/2025