Skip to content

Qwen3 Coder 480B A35B Instruct

Qwen3 Coder 480B A35B Instruct is an agentic-focused code model from Alibaba, built for complex multi-step software engineering workflows and accessible through AI Gateway without requiring separate provider accounts.

Tool Use
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'alibaba/qwen3-coder',
prompt: 'Why is the sky blue?'
})

Playground

Try out Qwen3 Coder 480B A35B Instruct by Alibaba. Usage is billed to your team at API rates. Free users (those who haven't made a payment) get $5 of credits every 30 days.

About Qwen3 Coder 480B A35B Instruct

Qwen3 Coder 480B A35B Instruct targets agentic coding: multi-step software engineering that involves reading files, running tools, debugging failures, and iterating across a real codebase. It's available on AI Gateway for teams building complex, multi-step coding workflows.

The underlying architecture is a 480-billion-parameter MoE model with 35 billion activated per token. This balances deep coding specialization against general reasoning and mathematics, with strong reported results among open models on SWE-Bench Verified.

Qwen3 Coder 480B A35B Instruct integrates with Qwen Code, Claude Code, and Cline. These developer toolchains require agentic coding models to handle tool calls, environment feedback, and iterative plan-execute-debug loops. The model natively supports context of 262.1K tokens, extendable to 1M tokens via YaRN extrapolation, giving it the working memory to hold substantial codebases or long conversation histories during extended development sessions.

Accessing Qwen3 Coder 480B A35B Instruct through AI Gateway removes the friction of managing separate provider credentials. Automatic retries and failover mean that infrastructure issues at any individual provider don't interrupt ongoing coding sessions or break CI pipelines.

Providers

Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.

Provider
Context
Latency
Throughput
Input
Output
Cache
Web Search
Per Query
Capabilities
ZDR
No Training
Release Date
DeepInfra
Legal:Terms
Privacy
262K
0.3s
52tps
$0.40/M$1.60/M
Read:$0.02/M
Write:
04/01/2025
Novita AI
Legal:Terms
Privacy
262K
1.2s
87tps
$0.30/M$2.50/M
04/01/2025
Google Vertex
Legal:Terms
Privacy
262K
0.7s
$0.22/M$1.80/M
Read:$0.02/M
Write:
04/01/2025
Alibaba
Legal:Terms
Privacy
262K
1.1s
97tps
$1.50/M
$7.50/M
Read:
$0.3/M
Write:
04/01/2025
Throughput

P50 throughput on live AI Gateway traffic, in tokens per second (TPS). Visit the docs for more info.

Latency

P50 time to first token (TTFT) on live AI Gateway traffic, in milliseconds. View the docs for more info.

Uptime

Direct request success rate on AI Gateway and per-provider. Visit the docs for more info.

More models by Alibaba

Model
Context
Latency
Throughput
Input
Output
Cache
Web Search
Per Query
Capabilities
Providers
ZDR
No Training
Release Date
240K
2.9s
50tps
$1.30/M
$7.80/M
Read:
$0.26/M
Write:
$1.63/M
alibaba logo
04/20/2026
1M
0.8s
68tps
$0.50/M
$3.00/M
Read:
$0.1/M
Write:
$0.63/M
alibaba logo
fireworks logo
04/02/2026
1M
0.7s
179tps
$0.10/M$0.40/M
Read:$0.0/M
Write:$0.13/M
alibaba logo
02/24/2026
1M
2.5s
56tps
$0.40/M
$2.40/M
Read:
$0.04/M
Write:
$0.5/M
alibaba logo
02/16/2026
256K
0.2s
67tps
$0.50/M$1.20/M
bedrock logo
togetherai logo
07/22/2025
33K
$0.02/M
deepinfra logo
06/05/2025

What To Consider When Choosing a Provider

  • Configuration: For coding-focused workloads, provider latency can affect the feel of interactive development tools. Consider testing provider-specific response times against your throughput requirements.
  • Zero Data Retention: AI Gateway supports Zero Data Retention for this model via direct gateway requests (BYOK is not included). To configure this, check the documentation.
  • Authentication: AI Gateway authenticates requests using an API key or OIDC token. You do not need to manage provider credentials directly.

When to Use Qwen3 Coder 480B A35B Instruct

Best For

  • Agentic software engineering tasks: Workflows where the model needs to read code, plan changes, execute tools, interpret results, and iterate, rather than one-shot generation, are exactly what Qwen3 Coder 480B A35B Instruct is tuned for
  • SWE-bench-style repository-level tasks: Strong SWE-Bench Verified results among open models indicate solid performance on the kinds of multi-file, multi-step bug fixing and feature implementation tasks representative of real software maintenance
  • Building AI-powered developer tools: Teams constructing coding assistants, automated PR review, or code migration tools can use Qwen3 Coder 480B A35B Instruct as the reasoning backbone, with the long context window holding full file context
  • Tool-calling pipelines in development environments: Qwen3 Coder 480B A35B Instruct's orientation toward agentic tool use makes it reliable in frameworks that expose file system, shell, and test runner tools to the model during a coding session

Consider Alternatives When

  • The task is simple code completion or short snippet generation: Smaller, faster models can handle routine code completion without the overhead of an agentic model. Cost and latency are lower for routine generation
  • Non-coding tasks dominate the workload: Qwen3 Coder 480B A35B Instruct is specialized for code and agentic scenarios. For general text generation, summarization, or multilingual tasks not involving code, general-purpose Qwen3 variants will serve better
  • You need image or multimodal input handling: Qwen3 Coder 480B A35B Instruct is a text model; it doesn't process screenshots, diagrams, or visual inputs

Conclusion

Qwen3 Coder 480B A35B Instruct fills a specific niche: agentic software engineering at scale, where the model participates in extended coding sessions rather than answering isolated questions. AI Gateway exposes it for exactly this use case, and the provider network's automatic retries make it more reliable in CI and interactive tooling contexts than a direct single-provider integration would be.

Frequently Asked Questions

  • Why use Qwen3 Coder 480B A35B Instruct through AI Gateway?

    You can access Qwen3 Coder 480B A35B Instruct through AI Gateway without separate provider accounts. Built-in retries and failover keep coding pipelines running without interruption.

  • What makes Qwen3 Coder 480B A35B Instruct "agentic" compared to other coding models?

    Agentic coding means the model can operate in multi-turn, tool-using environments where it reads context, decides on actions, executes tools, interprets the results, and adapts its plan accordingly. Qwen3 Coder 480B A35B Instruct is tuned for this loop rather than one-shot code completion, which shows up in its tool-calling reliability and multi-step task performance.

  • What context length does Qwen3 Coder 480B A35B Instruct support?

    262.1K tokens natively, extendable to one million tokens using YaRN extrapolation. This context window lets agentic tasks hold entire repositories or extended tool-call histories in context.

  • Which developer tools is Qwen3 Coder 480B A35B Instruct integrated with?

    The model works with Qwen Code, Claude Code, and Cline out of the box. On AI Gateway, you can use the qwen3-coder model identifier from AI SDK, Chat Completions API, Responses API, Messages API, or other API formats.

  • How does AI Gateway improve reliability for coding pipeline use cases?

    AI Gateway adds automatic retries and failover across deepinfra, novita, vertex, alibaba. If a provider returns an error during a long agentic session, AI Gateway handles the retry transparently. This prevents the kind of mid-session failures that break automated coding workflows.

  • What benchmarks has Qwen3 Coder 480B A35B Instruct been evaluated on?

    The model reports strong results among open models on Agentic Coding, Agentic Browser-Use, and Agentic Tool-Use benchmarks, with SWE-Bench Verified performance cited as a specific highlight. See https://deepinfra.com/Qwen/Qwen3-Coder-480B-A35B-Instruct for detailed results.

  • Can I use Qwen3 Coder 480B A35B Instruct for non-coding tasks?

    Qwen3 Coder 480B A35B Instruct retains general reasoning and math capability alongside its coding specialization and can handle general tasks, but dedicated general-purpose Qwen3 models are better suited when code isn't the primary use case.