Model Fallbacks

Last updated January 30, 2026

You can configure model failover to specify backups that are tried in order if the primary model fails or is unavailable.

Using the `models` option

Use the models array in providerOptions.gateway to specify fallback models:

app/api/chat/route.ts

import { streamText } from 'ai';
 
export async function POST(request: Request) {
  const { prompt } = await request.json();
 
  const result = streamText({
    model: 'openai/gpt-5.4', // Primary model
    prompt,
    providerOptions: {
      gateway: {
        models: ['anthropic/claude-sonnet-4.6', 'google/gemini-3-flash'], // Fallback models
      },
    },
  });
 
  return result.toUIMessageStreamResponse();
}

In this example:

The gateway first attempts the primary model (openai/gpt-5.4)
If that fails, it tries anthropic/claude-sonnet-4.6
If that also fails, it tries google/gemini-3-flash
The response comes from the first model that succeeds

Combining with provider routing

You can use models together with order to control both model failover and provider preference:

app/api/chat/route.ts

import { streamText } from 'ai';
 
export async function POST(request: Request) {
  const { prompt } = await request.json();
 
  const result = streamText({
    model: 'openai/gpt-5.4',
    prompt,
    providerOptions: {
      gateway: {
        models: ['openai/gpt-5-nano', 'anthropic/claude-sonnet-4.6'],
        order: ['azure', 'openai'], // Provider preference for each model
      },
    },
  });
 
  return result.toUIMessageStreamResponse();
}

This configuration:

Tries openai/gpt-5.4 via Azure, then OpenAI
If both fail, tries openai/gpt-5-nano via Azure first, then OpenAI
If those fail, tries anthropic/claude-sonnet-4.6 via available providers

How failover works

When processing a request with model fallbacks:

The gateway routes the request to the primary model (the model parameter)
For each model, provider routing rules apply (using order or only if specified)
If all providers for a model fail, the gateway tries the next model in the models array
The response comes from the first successful model/provider combination

Failover happens automatically. To see which model and provider served your request, check the provider metadata.

Provider Options

Provider Timeouts

Was this helpful?

AI Cloud

Core Platform

Security

Company

Learn

Open Source

Use Cases

Tools

Users

Model Fallbacks

Using the `models` option

Combining with provider routing

How failover works

AI Cloud

Core Platform

Security

Company

Learn

Open Source

Use Cases

Tools

Users

Model Fallbacks

Using the models option

Combining with provider routing

How failover works

Using the `models` option