Skip to content

Qwen 3.7 Max

Qwen 3.7 Max is Alibaba's flagship agent-tuned model in the Qwen 3.7 line, with a context window of 991K tokens and an emphasis on long-horizon tool use, multi-file coding, and office workflow automation.

ReasoningTool UseVision (Image)File InputImplicit Caching
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'alibaba/qwen3.7-max',
prompt: 'Why is the sky blue?'
})

Frequently Asked Questions

  • What is the relationship between Qwen 3.7 Max and Qwen3.6-Max-Preview?

    Qwen 3.7 Max is the Max-tier flagship in the Qwen 3.7 generation, succeeding Qwen3.6-Max-Preview. Alibaba positions Qwen 3.7 Max as an agent foundation with improvements in long-horizon tool use, multi-file coding, and office workflow automation.

  • What is the context window for Qwen 3.7 Max?

    The context window is 991K tokens. This supports full-repository ingestion, long agent traces with hundreds of sequential tool calls, and multi-document analysis without segmentation.

  • Can Qwen 3.7 Max accept image or file inputs?

    Qwen 3.7 Max is text-only. For vision input within the 3.7 line, use Qwen3.7-Plus, which is the multimodal entry in the generation.

  • Does Qwen 3.7 Max support tool calling?

    Yes. Qwen 3.7 Max supports structured tool calling and is tuned for agent workflows that chain many sequential tool invocations across long-horizon sessions.

  • How does extended-thinking mode work on Qwen 3.7 Max?

    Extended-thinking mode generates an internal reasoning trace before producing the final answer, which improves accuracy on high-difficulty logical reasoning, scientific computation, and expert-level queries. The thinking budget is tunable per request to balance depth against latency and token spend.

  • How do I access Qwen 3.7 Max through AI Gateway?

    Authenticate with an AI Gateway API key or OIDC token and reference `alibaba/qwen3.7-max` as the model. You can call Qwen 3.7 Max through AI SDK, Chat Completions API, Responses API, Messages API, or other API formats, from TypeScript or Python.

  • Does Qwen 3.7 Max support zero data retention?

    Zero Data Retention is not currently available for this model. Zero Data Retention is offered on a per-provider basis. See https://vercel.com/docs/ai-gateway/capabilities/zdr for details.

  • Where can I see live latency and cost data for Qwen 3.7 Max?

    This page shows live throughput, time-to-first-token, and pricing metrics for Qwen 3.7 Max measured across real AI Gateway traffic.