Skip to content
Dashboard

GPT-4o Transcribe

GPT-4o Transcribe is a speech-to-text model that uses GPT-4o to transcribe audio. It offers improvements to word error rate and better language recognition and accuracy compared to original Whisper models. Use it for more accurate transcripts.

index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'openai/gpt-4o-transcribe',
prompt: 'Why is the sky blue?'
})

Providers

Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.

Provider
Context
Latency
Throughput
Input
Output
Cache
Web Search
Per Query
Capabilities
ZDR
No Training
Release Date
OpenAI
$2.50/M+1 more
$10.00/M———03/13/2024

More models by OpenAI

Model
Context
Latency
Throughput
Input
Output
Cache
Web Search
Per Query
Capabilities
Providers
ZDR
No Training
Release Date
1M
0.8s
33tps
$5.00/M
$30.00/M
Read:
$0.5/M
Write:
—
$10.00/K
+ input costs
—
+4
azure logo
bedrock logo
openai logo
04/24/2026
400K
1.3s
238tps
$0.75/M$4.50/M
Read:$0.07/M
Write:—
$10.00/K
+ input costs
—
+4
azure logo
openai logo
03/17/2026
400K
0.4s
164tps
$0.20/M$1.25/M
Read:$0.02/M
Write:—
$10.00/K
+ input costs
—
+4
azure logo
openai logo
03/17/2026
1.1M
1.7s
85tps
$2.50/M
$15.00/M
Read:
$0.25/M
Write:
—
$10.00/K
+ input costs
—
+4
azure logo
openai logo
03/05/2026
400K
3.4s
97tps
$0.25/M$2.00/M
Read:$0.03/M
Write:—
$14/K
+ input costs
—
+3
azure logo
openai logo
08/07/2025
131K
0.1s
1596tps
$0.35/M$0.75/M
Read:$0.25/M
Write:—
——
baseten logo
bedrock logo
cerebras logo
+5
08/05/2025