Qwen3 VL 235B A22B Instruct

Qwen3 VL 235B A22B Instruct is Alibaba Cloud's multimodal vision-language model supporting interleaved text, images, and video over a native context of 262.1K tokens, with architectural improvements in spatial-temporal modeling and agentic GUI interaction. Your use subject to Alibaba Cloud's Terms & Privacy Policies.

Tool UseVision (Image)

Use with AI Gateway View docs

TypeScript

Python

import { streamText } from 'ai'

const result = streamText({
  model: 'alibaba/qwen3-vl-instruct',
  prompt: 'Why is the sky blue?'
})

Read docs

Overview About Providers Throughput Latency Uptime Status Similar FAQ

More models by Alibaba Cloud

Model

Context	Latency	Throughput	Input	Output	Cache	Web Search	Capabilities	Providers	ZDR	No Training	Release Date

alibaba/qwen3.7-flash

991K

3.0s

216tps

$0.03/M+2 more

$0.13/M+2 more

Read:

$0.01/M+2 more

Write:

$0.04/M+2 more

—

07/28/2026

alibaba/qwen3.7-plus

1.2s

269tps

$0.32/M

$1.28/M

Read:$0.08/M

Write:$0.5/M

—

06/02/2026

alibaba/qwen3.7-max

991K

1.4s

57tps

$2.50/M

$7.50/M

Read:$0.5/M

Write:$3.13/M

—

05/21/2026

alibaba/qwen3.6-plus

1.3s

115tps

$0.50/M+1 more

$3/M+1 more

Read:

$0.1/M+1 more

Write:

$0.63/M+1 more

—

04/02/2026

alibaba/qwen3.5-flash

0.9s

282tps

$0.10/M

$0.40/M

Read:$0.0/M

Write:$0.13/M

—

02/24/2026

alibaba/qwen3-embedding-8b

33K

$0.05/M

—

06/05/2025

Agent Stack

Core Platform

Tools

Learn

Build

Explore

Qwen3 VL 235B A22B Instruct

More models by Alibaba Cloud