Qwen3 VL 235B A22B Instruct

Qwen3 VL 235B A22B Instruct is Alibaba Cloud's multimodal vision-language model supporting interleaved text, images, and video over a native context of 262.1K tokens, with architectural improvements in spatial-temporal modeling and agentic GUI interaction.

Tool UseVision (Image)

index.ts

import { streamText } from 'ai'

const result = streamText({
  model: 'alibaba/qwen3-vl-instruct',
  prompt: 'Why is the sky blue?'
})

Overview About Providers Throughput Latency Uptime Status Similar FAQ

Playground

Try out Qwen3 VL 235B A22B Instruct by Alibaba Cloud. Usage is billed to your team at API rates. Free users (those who haven't made a payment) get $5 of credits every 30 days.

Qwen3 VL 235B A22B Instruct

Agent Stack

Core Platform

Tools

Learn

Build

Explore

Qwen3 VL 235B A22B Instruct

Playground