Skip to content

o3-mini

o3-mini is a cost-efficient reasoning model in the o3 family, delivering strong chain-of-thought performance on math, code, and science at a fraction of full o3's cost, with configurable reasoning effort for flexible cost-quality tradeoffs.

File InputReasoningTool UseImplicit Caching
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'openai/o3-mini',
prompt: 'Why is the sky blue?'
})

About o3-mini

o3-mini was released on December 20, 2024 as the cost-efficient tier of the o3 reasoning model family. It continues the pattern established by o1-mini: delivering strong chain-of-thought reasoning on structured domains (mathematics, coding, science) at a fraction of the full model's cost.

The model supports the reasoning_effort parameter, letting you control reasoning depth per request. Low effort for straightforward technical queries conserves tokens and reduces cost; high effort for competition-level problems applies the full reasoning capability. This flexibility lets you use o3-mini as the default for all technical queries rather than maintaining a routing layer.

With a context window of 200K tokens and support for the standard API features, o3-mini handles the same types of requests as full o3. The tradeoff is concentrated in reasoning depth: on the hardest problems, full o3 will produce more thorough analysis.