How is AI Gateway priced?

We offer tokens at list price from the upstream providers with no markup, including when you bring your own keys. Certain capabilities are available at higher plan tiers and metered separately. See the pricing page for details.

What's the difference between using AI Gateway and going direct to each provider?

AI Gateway gives you one integration, automatic failover, unified spend tracking, and one invoice across every major provider. Going direct means signing N contracts and stitching together N billing dashboards.

Will AI Gateway work with our existing AI stack?

Almost certainly. AI Gateway supports the AI SDK, OpenAI Chat Completions, OpenAI Responses, Anthropic Messages, and an OpenResponses-compatible endpoint. Migrating is typically a base URL swap with no code changes.

Which modalities does AI Gateway support?

Text, image, video, embeddings, and reranking, all through the same endpoint. Browse the full model catalog for specific models and providers.

How does AI Gateway handle our enterprise security and compliance requirements?

AI Gateway supports Zero Data Retention routing, a no-training guarantee, and team-wide provider allowlists. See the security overview for full details.

What observability does AI Gateway provide out of the box?

A dashboard with usage, spend, request volume, TTFT, and token counts, broken down by model, provider, and project. For deeper analysis, the Custom Reporting API lets you pull the same data into your own tools.

Can we use our existing provider contracts and committed spend?

Yes, through BYOK. Bring your own keys for almost every supported provider and your existing commitments flow through. We try BYOK first and only fall back to system credentials on failure.

Do I pay per request or get invoiced?

AI Gateway uses pre-purchased credits by default. Top up in the dashboard and usage is drawn down per request. Enterprise customers can switch to a single consolidated invoice from Vercel covering every provider in their routing pool. For invoicing, reach out to sales for more details.

The AI Gateway For Developers

Get started Read the docs

Hundreds of models, one enterprise grade pane of glass.

TypeScript

import { streamText } from 'ai'

const result = streamText({
  model: 'xai/grok-4.3',
  prompt: 'Why is the sky blue?'
})

andmore

Markup on TokensPay provider prices,
never a cent more.

Optimize routing for availability, cost, or latency

If a provider degrades, the Gateway fails over to the same model on another provider. Identical output, no downtime.

Everyday requests route to a cost-efficient open model. Complex jobs escalate to a frontier model only when needed.

Latency-sensitive requests route to the fastest-responding model for the lowest time to first token. Heavier requests fall through to a larger model.

App

Provider degraded?

Claude Opus 4.8Anthropic

Degraded

Claude Opus 4.8Amazon Bedrock

Latency figures are illustrative and vary by region, traffic, and prompt length.

Routing, billing, and observability in one place

Providing the developer experience and infrastructure to build, scale, and secure a faster, more personalized web.

One API key, hundreds of models. Unified billing and observability across your entire AI stack, with text, image, video, and audio models.
Route on behavior, fallback anytime Automatic fallbacks during provider outages so your app stays up even when a model goes down.
OpenAI$12.47
Anthropic$8.22
xAI$4.31
Platform fee$0.00
Total$25.00
No markup, just fair prices Pay exactly what providers charge with no platform fees.

Works with your existing AI stack

- X_AI_API_KEY- ANTHROPIC_API_KEY- OPEN_AI_API_KEY+ AI_GATEWAY_API_KEY

Move existing OpenAI, Anthropic, and AI SDK integrations to AI Gateway with a base URL swap.

AISDK

The open-source AI toolkit designed to help developers build AI-powered applications and agents with React, Next.js, Vue, Svelte, Node.js, and more.

OpenAI Chat Completions

OpenAI Responses

Anthropic Messages

Open Responses

“Moving to the gateway is just so ergonomic. We get references to model names, and rely on Vercel to do the correct implementations and handle the edge cases.”

Rob CheungCo-founder

20x
Improvement in AI
reliability after switching
30s
To adopt a new model,
down from an hour of code
25%
Improvement in average
latency on model calls

Production-ready,
from request one

Security, spend controls, and compatibility built for teams running real traffic.

Security and complianceRoute only to ZDR providers. No training on data, no prompt logging. Configure custom rules per request or team-wide.
Track everythingTrack tokens, requests, spend, and performance across your AI stack. Cap spend with per-key budgets and per-user quotas.
Drop-in compatibleUse the API formats you already have. Migrate existing apps to AI Gateway in minutes, no SDK rewrites required.

Take control

Manage usage across teams with budgets, quotas, and full request visibility.

Set controls

API key management Create, rotate, and revoke keys. Set per-key budgets that enforce spend limits from the dashboard or CLI.

Per-user spending limits. Cap spend for individual users on shared keys. No separate API key required.

Short-lived OIDC tokens. Authenticate with expiring credentials. No static credentials to manage.

See everything

Custom Reporting API. Tag requests by user, customer, feature, or environment. Analyze spend in the dashboard or your own systems.

Dashboard observability. Usage, spend, requests, TTFT, and token counts at team, API key, and project scope.

Works with the tools your team already uses

Route the most popular AI coding agents through AI Gateway with a base URL change. Get unified observability and spend tracking across every tool, no matter who built it.

See all supported coding agents

Get Started

This quickstart walks you through making your first text generation request with AI Gateway.

Create a new directory and initialize a Node.js project.

Terminal

mkdir ai-text-demo
cd ai-text-demo
pnpm init

Install the AI SDK and development dependencies.

Terminal

npm install ai dotenv @types/node tsx typescript

Go to the AI Gateway API Keys page in your Vercel dashboard and click Create Key to generate a new API Key. Create a .env.local file and save your API Key.

.env.local

AI_GATEWAY_API_KEY=your_ai_gateway_api_key

Create the index.ts file.

index.ts

import { streamText } from 'ai';
import 'dotenv/config';

async function main() {
  const result = streamText({
    model: 'openai/gpt-5.5',
    prompt: 'Invent a new holiday and describe its traditions.',
  });

  for await (const textPart of result.textStream) {
    process.stdout.write(textPart);
  }

  console.log();
  console.log('Token usage:', await result.usage);
  console.log('Finish reason:', await result.finishReason);
}

main().catch(console.error);

You should see the AI model’s response stream to your terminal.

Terminal

pnpm tsx index.ts

1Set Up Your Project

Create a new directory and initialize a Node.js project.

Terminal

mkdir ai-text-demo
cd ai-text-demo
pnpm init

2Install Dependencies

Install the AI SDK and development dependencies.

Terminal

npm install ai dotenv @types/node tsx typescript

3Set Up Your API Key

Go to the AI Gateway API Keys page in your Vercel dashboard and click Create Key to generate a new API Key. Create a .env.local file and save your API Key.

.env.local

AI_GATEWAY_API_KEY=your_ai_gateway_api_key

4Create and Run Your Script

Create the index.ts file.

index.ts

import { streamText } from 'ai';
import 'dotenv/config';

async function main() {
  const result = streamText({
    model: 'openai/gpt-5.5',
    prompt: 'Invent a new holiday and describe its traditions.',
  });

  for await (const textPart of result.textStream) {
    process.stdout.write(textPart);
  }

  console.log();
  console.log('Token usage:', await result.usage);
  console.log('Finish reason:', await result.finishReason);
}

main().catch(console.error);

5Run your script.

You should see the AI model’s response stream to your terminal.

Terminal

pnpm tsx index.ts

6Next steps

Agent Stack

Core Platform

Tools

Learn

Build

Explore

The AI Gateway For Developers

Optimize routing for availability, cost, or latency

Routing, billing, and observability in one place

Works with your existing AI stack

Production-ready,
from request one

Take control

Works with the tools your team already uses

Get Started

Frequently asked questions

The AI Gateway For Developers

Optimize routing for availability, cost, or latency

Routing, billing, and observability in one place

Works with your existing AI stack

Production-ready,from request one

Take control

Works with the tools your team already uses

Get Started

Frequently asked questions

[ 01 ]How is AI Gateway priced?

[ 02 ]What's the difference between using AI Gateway and going direct to each provider?

[ 03 ]Will AI Gateway work with our existing AI stack?

[ 04 ]Which modalities does AI Gateway support?

[ 05 ]How does AI Gateway handle our enterprise security and compliance requirements?

[ 06 ]What observability does AI Gateway provide out of the box?

[ 07 ]Can we use our existing provider contracts and committed spend?

[ 08 ]Do I pay per request or get invoiced?

Production-ready,
from request one