GPT-5 mini
GPT-5 mini delivers GPT-5 family intelligence at a reduced cost tier, making advanced reasoning, coding, and multimodal capabilities accessible for high-volume production workloads where full GPT-5 pricing is impractical.
import { streamText } from 'ai'
const result = streamText({ model: 'openai/gpt-5-mini', prompt: 'Why is the sky blue?'})Frequently Asked Questions
How does GPT-5 mini compare to GPT-4o mini?
GPT-5 mini is the next generation of OpenAI's mid-tier model, delivering improved reasoning, coding, and instruction following compared to GPT-4o mini.
What context window does GPT-5 mini support?
400K tokens, enabling extensive document processing and conversation history retention.
When should I use full GPT-5 instead of mini?
When the task demands maximum capability, particularly on complex reasoning, nuanced writing, or challenging coding problems where the quality gap is measurable and consequential.
Does GPT-5 mini support function calling and structured outputs?
Yes. It supports the full API feature set including function calling, structured outputs via JSON schema, vision input, and system messages.
How does AI Gateway handle authentication for GPT-5 mini?
AI Gateway accepts a single API key or OIDC token for all requests. You don't embed OpenAI credentials in your application; AI Gateway routes and authenticates on your behalf.
What is the pricing for GPT-5 mini?
Pricing appears on this page and updates as providers adjust their rates. AI Gateway routes traffic through the configured provider.
What are typical latency characteristics?
This page shows live throughput and time-to-first-token metrics measured across real AI Gateway traffic.