Skip to content
Dashboard

Qwen 3 VL 235B A22B Instruct

Qwen 3 VL 235B A22B Instruct is Alibaba's 235B mixture-of-experts vision-language model with 22B active parameters per token, supporting interleaved text, images, and video over a context window of 262.1K tokens for visual coding, spatial perception, and fine-grained visual understanding.

ReasoningTool UseImplicit CachingFile InputVision (Image)
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'alibaba/qwen3-vl-235b-a22b-instruct',
prompt: 'Why is the sky blue?'
})

Providers

Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.

Provider
Context
Latency
Throughput
Input
Output
Cache
Web Search
Per Query
Capabilities
ZDR
No Training
Release Date
DeepInfra
262K
0.6s
24tps
$0.20/M$0.88/M
Read:$0.11/M
Write:
+3
Alibaba
131K
0.5s
52tps
$0.40/M$1.60/M