Skip to content
Dashboard

o3 Pro

o3 Pro applies maximum reasoning compute to the o3 architecture, delivering the deepest chain-of-thought analysis available for the most demanding mathematical, scientific, and analytical problems.

ReasoningVision (Image)File InputTool UseWeb Search
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'openai/o3-pro',
prompt: 'Why is the sky blue?'
})

Playground

Try out o3 Pro by OpenAI. Usage is billed to your team at API rates. Free users (those who haven't made a payment) get $5 of credits every 30 days.

openai logo
openai logo

Ask o3 Pro anything to try it out.

Providers

Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.

Provider
Context
Latency
Throughput
Input
Output
Cache
Web Search
Per Query
Capabilities
ZDR
No Training
Release Date
OpenAI
200K
4.2s
$20.00/M$80.00/M
$10/K
+ input costs
—
+3
06/10/2025
Throughput

P50 throughput on live AI Gateway traffic, in tokens per second (TPS). Visit the docs for more info.

Latency

P50 time to first token (TTFT) on live AI Gateway traffic, in milliseconds. View the docs for more info.

Uptime

Direct request success rate on AI Gateway and per-provider. Visit the docs for more info.

More models by OpenAI

Model
Context
Latency
Throughput
Input
Output
Cache
Web Search
Per Query
Capabilities
Providers
ZDR
No Training
Release Date
1M
2.5s
57tps
$5.00/M
$30.00/M
Read:
$0.5/M
Write:
—
$10.00/K
+ input costs
—
+4
azure logo
bedrock logo
openai logo
04/24/2026
400K
1.0s
177tps
$0.75/M$4.50/M
Read:$0.07/M
Write:—
$10.00/K
+ input costs
—
+4
azure logo
openai logo
03/17/2026
1.1M
1.0s
78tps
$2.50/M
$15.00/M
Read:
$0.25/M
Write:
—
$10.00/K
+ input costs
—
+4
azure logo
openai logo
03/05/2026
131K
0.2s
315tps
$0.35/M$0.75/M
Read:$0.25/M
Write:—
——
baseten logo
bedrock logo
cerebras logo
+5
08/05/2025
1M
0.3s
$0.10/M$0.40/M
Read:$0.03/M
Write:—
$14/K
+ input costs
—
+3
azure logo
openai logo
04/14/2025
128K
0.5s
70tps
$0.15/M$0.60/M
Read:$0.07/M
Write:—
$14/K
+ input costs
—
+3
azure logo
openai logo
07/18/2024

About o3 Pro

o3 Pro was released on June 10, 2025 as the premium tier of the o3 reasoning model family. It applies maximum compute to each reasoning request, enabling the deepest chain-of-thought analysis available from OpenAI.

The additional compute produces more thorough deliberation: the model explores more solution paths, checks its work more exhaustively, and arrives at conclusions with higher confidence. On the hardest mathematical, scientific, and analytical problems, this translates to measurably better accuracy.

The tradeoff is cost and latency. o3 Pro is the most expensive model in the o3 family and takes the longest to respond. For problems where the difference between good and best matters, that investment is justified. For standard reasoning tasks, o3 or o3-mini provide sufficient quality at substantially lower cost.

What To Consider When Choosing a Provider

  • Configuration: o3 Pro applies the most compute per request of any o3 variant. Use it when the problem genuinely requires exhaustive deliberation and the quality difference is consequential.
  • Configuration: Most teams reserve o3 Pro for their hardest queries, routing standard reasoning traffic to o3 or o3-mini.
  • Zero Data Retention: AI Gateway does not currently support Zero Data Retention for this model. See the documentation for models that support ZDR.
  • Authentication: AI Gateway authenticates requests using an API key or OIDC token. You do not need to manage provider credentials directly.

When to Use o3 Pro

Best For

  • Frontier mathematical reasoning: The hardest proofs, competition problems, and mathematical research
  • Critical scientific analysis: Complex derivations and multi-step scientific reasoning where errors have consequences
  • Exhaustive code analysis: Security audits and algorithmic verification requiring thorough deliberation
  • Research-grade reasoning: Academic and research problems that benefit from the deepest available analysis
  • High-stakes analytical decisions: Analyses where maximum reasoning depth directly impacts outcomes

Consider Alternatives When

  • Standard reasoning: O3 for most reasoning tasks at lower cost
  • Budget-friendly reasoning: O3-mini for cost-efficient chain-of-thought on routine problems
  • General-purpose tasks: GPT-5.2 pro for tasks that need premium quality but not specialized reasoning
  • Fast responses: Any standard model when response time matters more than reasoning depth

Conclusion

o3 Pro applies maximum compute within the o3 family to the hardest problems. For mathematical, scientific, and analytical challenges where reasoning depth directly determines quality, it is the premium tier of the o3 lineup.