Skip to content

GPT-5.1 Instant

GPT-5.1 Instant is the fastest model in the GPT-5.1 family, optimized for low-latency responses across general-purpose tasks, delivering GPT-5.1 generation quality at speeds suited for real-time applications.

Tool UseVision (Image)File InputReasoningImplicit CachingWeb Search
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'openai/gpt-5.1-instant',
prompt: 'Why is the sky blue?'
})

Providers

Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.

Provider
Context
Latency
Throughput
Input
Output
Cache
Web Search
Per Query
Capabilities
ZDR
No Training
Release Date
OpenAI
128K
1.5s
108tps
$1.25/M$10.00/M
Read:$0.13/M
Write:
$10.00/K
+ input costs
+4
11/12/2025
Azure
128K
1.0s
$1.25/M$10.00/M
Read:$0.13/M
Write:
$14/K
+ input costs
+3
11/12/2025