Skip to content

GPT-4.1 mini

GPT-4.1 mini delivers GPT-4o-class intelligence at reduced cost with nearly half the latency, making it a cost-performance option in the GPT-4.1 family for high-volume production workloads.

File InputTool UseVision (Image)Implicit Caching
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'openai/gpt-4.1-mini',
prompt: 'Why is the sky blue?'
})

Providers

Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.

Provider
Context
Latency
Throughput
Input
Output
Cache
Web Search
Per Query
Capabilities
ZDR
No Training
Release Date
Azure
1M
1.3s
104tps
$0.40/M$1.60/M
Read:$0.1/M
Write:
$14/K
+ input costs
+2
05/14/2025
OpenAI
1M
0.7s
72tps
$0.40/M$1.60/M
Read:$0.1/M
Write:
$10.00/K
+ input costs
+3
05/14/2025