Skip to content

Qwen 3 VL 235B A22B Instruct

Qwen 3 VL 235B A22B Instruct is Alibaba's 235B mixture-of-experts vision-language model with 22B active parameters per token, supporting interleaved text, images, and video over a context window of 262.1K tokens for visual coding, spatial perception, and fine-grained visual understanding.

ReasoningTool UseImplicit CachingFile InputVision (Image)
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'alibaba/qwen3-vl-235b-a22b-instruct',
prompt: 'Why is the sky blue?'
})

Frequently Asked Questions

  • What modalities does Qwen 3 VL 235B A22B Instruct accept?

    The model accepts interleaved text, images, and video frames within a single context window of up to 262.1K tokens, with output up to 262.1K tokens tokens per request.

  • How is Qwen 3 VL 235B A22B Instruct different from Qwen3-VL-Thinking?

    The instruct variant produces direct, structured answers and is generally faster and cheaper to run. Qwen 3 VL 235B A22B Instruct is tuned for instruction following without extended reasoning traces, while Qwen3-VL-Thinking emits a visible chain-of-thought before its final response and is better suited to complex visual STEM and compositional reasoning.

  • What does the 235B-A22B notation mean?

    Qwen 3 VL 235B A22B Instruct is a mixture-of-experts model with 235 billion total parameters, of which approximately 22 billion activate per token. This design keeps the active compute close to a 22B dense model while preserving the capability profile of a much larger network.

  • Which API surfaces work with Qwen 3 VL 235B A22B Instruct on AI Gateway?

    You can call Qwen 3 VL 235B A22B Instruct through AI SDK, Chat Completions API, Responses API, Messages API, or other API formats, from TypeScript or Python. Reference `alibaba/qwen3-vl-235b-a22b-instruct` as the model identifier in your request.

  • What is the context window for Qwen 3 VL 235B A22B Instruct?

    The context window is 262.1K tokens, which applies to the combined sequence of text tokens and visual tokens (image patches and video frames encoded as tokens) in an interleaved request.

  • Does Qwen 3 VL 235B A22B Instruct support zero data retention?

    Yes, Zero Data Retention is available for this model. Zero Data Retention is offered on a per-provider basis. See https://vercel.com/docs/ai-gateway/capabilities/zdr for details.

  • Where can I see live latency and cost data for Qwen 3 VL 235B A22B Instruct?

    This page shows live throughput, time-to-first-token, and pricing metrics for Qwen 3 VL 235B A22B Instruct measured across real AI Gateway traffic.