Skip to content
Dashboard

Qwen3 VL 235B A22B Instruct

Qwen3 VL 235B A22B Instruct is Alibaba's multimodal vision-language model supporting interleaved text, images, and video over a native context of 262.1K tokens, with architectural improvements in spatial-temporal modeling and agentic GUI interaction.

Tool UseVision (Image)
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'alibaba/qwen3-vl-instruct',
prompt: 'Why is the sky blue?'
})

Providers

Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.

Provider
Context
Latency
Throughput
Input
Output
Cache
Web Search
Per Query
Capabilities
ZDR
No Training
Release Date
Alibaba
131K
0.5s
56tps
$0.40/M$1.60/M
09/23/2025
Novita AI
131K
0.7s
46tps
$0.30/M$1.50/M
09/23/2025
DeepInfra
262K
0.5s
24tps
$0.20/M$0.88/M
Read:$0.11/M
Write:
+1
09/23/2025