Skip to content
Dashboard

GPT 5 Chat

GPT 5 Chat is a conversational variant of GPT-5 optimized for multi-turn dialogue, delivering the GPT-5 family's advanced capabilities in an interface tuned specifically for chat-based applications.

File InputImplicit CachingTool UseVision (Image)Web Search
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'openai/gpt-5-chat',
prompt: 'Why is the sky blue?'
})

Playground

Try out GPT 5 Chat by OpenAI. Usage is billed to your team at API rates. Free users (those who haven't made a payment) get $5 of credits every 30 days.

openai logo
openai logo

Ask GPT 5 Chat anything to try it out.

Providers

Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.

Provider
Context
Latency
Throughput
Input
Output
Cache
Web Search
Per Query
Capabilities
ZDR
No Training
Release Date
Azure
128K
0.5s
$1.25/M$10.00/M
Read:$0.13/M
Write:—
$14/K
+ input costs
—
+3
08/07/2025
OpenAI
128K
0.4s
61tps
$1.25/M$10.00/M
Read:$0.13/M
Write:—
$10.00/K
+ input costs
—
+3
08/07/2025
Throughput

P50 throughput on live AI Gateway traffic, in tokens per second (TPS). Visit the docs for more info.

Latency

P50 time to first token (TTFT) on live AI Gateway traffic, in milliseconds. View the docs for more info.

Uptime

Direct request success rate on AI Gateway and per-provider. Visit the docs for more info.

More models by OpenAI

Model
Context
Latency
Throughput
Input
Output
Cache
Web Search
Per Query
Capabilities
Providers
ZDR
No Training
Release Date
1M
2.0s
55tps
$5.00/M
$30.00/M
Read:
$0.5/M
Write:
—
$10.00/K
+ input costs
—
+4
azure logo
bedrock logo
openai logo
04/24/2026
400K
0.7s
170tps
$0.75/M$4.50/M
Read:$0.07/M
Write:—
$10.00/K
+ input costs
—
+4
azure logo
openai logo
03/17/2026
1.1M
0.9s
83tps
$2.50/M
$15.00/M
Read:
$0.25/M
Write:
—
$10.00/K
+ input costs
—
+4
azure logo
openai logo
03/05/2026
400K
4.3s
143tps
$0.25/M$2.00/M
Read:$0.03/M
Write:—
$14/K
+ input costs
—
+4
azure logo
openai logo
08/07/2025
131K
0.2s
316tps
$0.35/M$0.75/M
Read:$0.25/M
Write:—
——
baseten logo
bedrock logo
cerebras logo
+5
08/05/2025
128K
0.6s
81tps
$0.15/M$0.60/M
Read:$0.07/M
Write:—
$14/K
+ input costs
—
+3
azure logo
openai logo
07/18/2024

About GPT 5 Chat

GPT 5 Chat became available on August 7, 2025 as a conversational variant within the GPT-5 model family on AI Gateway. While the base GPT-5 model serves as a general-purpose model, GPT 5 Chat is specifically optimized for the patterns that matter in dialogue: maintaining coherence across many turns, producing natural conversational flow, and adapting tone and detail level based on the conversation context.

The model retains the GPT-5 family's core capabilities, including the context window of 128K tokens, strong instruction following, and multimodal input support. The optimization focuses on how these capabilities are applied in conversational settings, producing responses that feel more natural in back-and-forth dialogue.

For developers building chat-based products, GPT 5 Chat reduces the prompt engineering effort needed to achieve natural conversation quality. The model's training emphasizes the conversational use case directly rather than requiring developers to shape general-purpose model output into dialogue form.

What To Consider When Choosing a Provider

  • Configuration: GPT 5 Chat is specifically tuned for conversational patterns. If your application is primarily a chat interface, this variant may produce more natural dialogue flow than the base GPT-5 model.
  • Configuration: The base GPT-5 is a general-purpose model; this variant prioritizes conversational coherence and turn-taking quality.
  • Zero Data Retention: AI Gateway supports Zero Data Retention for this model via direct gateway requests (BYOK is not included). To configure this, check the documentation.
  • Authentication: AI Gateway authenticates requests using an API key or OIDC token. You do not need to manage provider credentials directly.

When to Use GPT 5 Chat

Best For

  • Customer-facing chatbots: Natural, coherent multi-turn conversations with strong reasoning
  • Interactive assistants: Conversational interfaces that require context retention across many turns
  • Tutoring and education: Dialogue-based learning experiences with adaptive conversation flow
  • Support automation: Multi-step troubleshooting conversations that maintain context
  • Conversational search: Natural language interaction for exploring complex topics

Consider Alternatives When

  • Non-conversational workloads: Base GPT-5 is better for single-shot tasks, batch processing, or structured generation
  • Cost-sensitive chat: GPT-5 mini provides strong conversational capability at lower cost
  • Simple Q&A: GPT-4o mini handles straightforward chat at a fraction of the price
  • Deep reasoning in chat: O3 provides stronger chain-of-thought for technically demanding conversations

Conclusion

GPT 5 Chat brings GPT-5's capabilities to a conversational interface optimized for multi-turn dialogue. For chat-based applications on AI Gateway where conversation quality and natural flow are priorities, it is purpose-built for the task.