Whisper
Whisper is a general-purpose speech recognition model, trained on a large dataset of diverse audio. You can also use it as a multitask model to perform multilingual speech recognition as well as speech translation and language identification.
translationTranscription
index.ts
import { experimental_transcribe as transcribe } from 'ai';import { gateway } from '@ai-sdk/gateway';import { readFile } from 'node:fs/promises';
const result = await transcribe({ model: gateway.transcriptionModel('openai/whisper-1'), audio: await readFile('audio.mp3'),});Playground
Try out Whisper by OpenAI. Usage is billed to your team at API rates. Free users (those who haven't made a payment) get $5 of credits every 30 days.
Speech to text
Record a short clip from your microphone and the model transcribes it to text.
Record a clip to see the transcript here.
Providers
Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.
| Provider |
|---|
More models by OpenAI
| Model |
|---|