Qwen3 Max Preview
Qwen3 Max Preview is Alibaba's early-access release of its trillion-parameter Qwen3-Max model, providing developers with ahead-of-schedule access to Qwen3-Max capabilities for evaluation and prototyping.
import { streamText } from 'ai'
const result = streamText({ model: 'alibaba/qwen3-max-preview', prompt: 'Why is the sky blue?'})Frequently Asked Questions
What is the relationship between Qwen3 Max Preview and Qwen3-Max?
Qwen3 Max Preview provides early access to the same underlying trillion-parameter model. The preview designation signals ahead-of-GA access; capability and architecture are the same as the production release.
Are there rate limits specific to the preview version?
Preview models may be subject to capacity-based rate limits that differ from the GA release.
How large is the context window on Qwen3 Max Preview?
262.1K tokens, matching the Qwen3-Max production release.
Will my prompts built for the preview work with the GA model?
In most cases yes, since the models share the same architecture and training. Thorough regression testing before switching identifiers is recommended, as minor behavioral changes can occur between preview and GA.
Does the preview support context caching?
Context caching availability depends on the serving provider; confirm support at your chosen provider before designing a caching strategy around repeated long prompts.
What coding and math benchmarks has the underlying model been evaluated on?
The underlying Qwen3-Max model scored 69.6 on SWE-bench Verified and 79.3% on LiveBench, with competitive results on AIME mathematical reasoning tasks.
Is it possible to access the model weights for Qwen3 Max Preview?
No. Qwen3-Max is a closed-weight model available only via API, both in preview and GA form.