Qwen3 235B A22B

Qwen3 235B A22B is Alibaba's large-scale 235B mixture-of-experts model with a context window of 262.1K tokens, activating 22 billion of 235 billion parameters per inference to deliver strong reasoning, coding, and multilingual performance.

Tool Use

index.ts

import { streamText } from 'ai'

const result = streamText({
  model: 'alibaba/qwen-3-235b',
  prompt: 'Why is the sky blue?'
})

Overview Playground About Providers Throughput Latency Uptime Status Similar FAQ

About Qwen3 235B A22B

Qwen3 235B A22B is the largest MoE in the Qwen3 family. Its MoE design routes each token through 8 of 128 expert layers, activating only 22 billion of the full 235 billion parameters. This sparsity keeps serving costs proportional to the activated parameter count while the full parameter space retains the breadth of knowledge needed for performance competitive with large proprietary models.

The model covers 119 languages and dialects.

On benchmark evaluations, Qwen3 235B A22B achieves results competitive with other strong reasoning models across coding, mathematics, and general capability assessments. Like all Qwen3 models, it supports a hybrid reasoning system: thinking mode activates extended computation for step-by-step problem solving, while non-thinking mode produces immediate responses when latency matters more than deliberation. You can configure a thinking budget per request to tune the cost-quality tradeoff dynamically.

The model supports tool calling and Model Context Protocol (MCP), making it suitable for multi-step workflows where the model orchestrates external tools or APIs. Alibaba recommends pairing it with the Qwen-Agent framework for complex agentic pipelines.

AI Cloud

Core Platform

Security

Company

Learn

Open Source

Use Cases

Tools

Users

Qwen3 235B A22B

About Qwen3 235B A22B