Qwen 3 VL 235B A22B Instruct

Qwen 3 VL 235B A22B Instruct is Alibaba Cloud's 235B mixture-of-experts vision-language model with 22B active parameters per token, supporting interleaved text, images, and video over a context window of 262.1K tokens for visual coding, spatial perception, and fine-grained visual understanding.

Implicit CachingTool UseVision (Image)

index.ts

import { streamText } from 'ai'

const result = streamText({
  model: 'alibaba/qwen3-vl-235b-a22b-instruct',
  prompt: 'Why is the sky blue?'
})

Overview About Providers Throughput Latency Uptime Status Similar FAQ

More models by Alibaba Cloud

Model

Context	Latency	Throughput	Input	Output	Cache	Web Search	Per Query	Capabilities	Providers	ZDR	No Training	Release Date

alibaba/qwen3.7-plus

0.8s

348tps

$0.32/M

$1.28/M

Read:$0.08/M

Write:$0.5/M

—

06/02/2026

alibaba/qwen3.7-max

991K

2.9s

55tps

$1.25/M

$3.75/M

Read:$0.25/M

Write:$1.56/M

—

05/21/2026

alibaba/qwen3.6-plus

1.4s

109tps

$0.50/M

$3.00/M

Read:

$0.1/M

Write:

$0.63/M

—

04/02/2026

alibaba/qwen3.5-flash

1.0s

308tps

$0.10/M

$0.40/M

Read:$0.0/M

Write:$0.13/M

—

02/24/2026

alibaba/qwen3-embedding-8b

33K

$0.05/M

—

06/05/2025

alibaba/qwen-3-235b

262K

0.4s

88tps

$0.09/M

$0.10/M

—

04/28/2025

Agent Stack

Core Platform

Tools

Learn

Build

Explore

Qwen 3 VL 235B A22B Instruct

More models by Alibaba Cloud