Whisper

Whisper is OpenAI's general-purpose speech recognition model, trained on 680,000 hours of multilingual audio and able to transcribe speech, translate it to English, and identify languages as a single multitask model. Your use subject to OpenAI's Terms & Privacy Policies.

translationTranscription

Use with AI Gateway View docs

import { experimental_transcribe as transcribe } from 'ai';
import { gateway } from '@ai-sdk/gateway';
import { readFile } from 'node:fs/promises';

const result = await transcribe({
  model: gateway.transcriptionModel('openai/whisper-1'),
  audio: await readFile('audio.mp3'),
});

Read docs

Overview About Providers Similar FAQ

More models by OpenAI

Model

Context	Latency	Throughput	Input	Output	Cache	Web Search	Capabilities	Providers	ZDR	No Training	Release Date

openai/gpt-5.6-luna

1.1M

1.7s

167tps

$1/M$0.20/M

+1 more

$6/M$1.20/M

+1 more

Read:

$0.1/M$0.02/M+1 more

Write:

$1.25/M$0.25/M+1 more

$10/K

+ input costs

07/09/2026

openai/gpt-5.6-sol

1.1M

2.3s

82tps

$5/M+1 more

$30/M+1 more

Read:

$0.5/M+1 more

Write:

$6.25/M+1 more

$10/K

+ input costs

07/09/2026

openai/gpt-5.4-mini

400K

0.7s

173tps

$0.75/M

$4.50/M

Read:$0.07/M

Write:—

$10/K

+ input costs

03/17/2026

openai/gpt-5-nano

400K

4.5s

190tps

$0.05/M

$0.40/M

Read:$0.01/M

Write:—

$14/K

+ input costs

08/07/2025

openai/gpt-5-mini

400K

2.8s

82tps

$0.25/M

$2/M

Read:$0.03/M

Write:—

$14/K

+ input costs

08/07/2025

openai/gpt-oss-120b

131K

0.2s

480tps

$0.35/M

$0.75/M

Read:$0.25/M

Write:—

—

08/05/2025

Agent Stack

Core Platform

Tools

Learn

Build

Explore

Whisper

More models by OpenAI