Skip to content

GPT 5.4 Nano

GPT 5.4 Nano is the smallest and most affordable model in the GPT-5.4 family, performing close to GPT-5.4 Mini in evaluations at a lower price point, built for high-volume sub-agent workflows.

ReasoningTool UseImplicit CachingWeb SearchVision (Image)File Input
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'openai/gpt-5.4-nano',
prompt: 'Why is the sky blue?'
})

Frequently Asked Questions

  • How does GPT 5.4 Nano compare to GPT-5.4 Mini?

    It performs close to GPT-5.4 Mini in evaluations at a lower price point. Choose it when cost scales with the number of parallel calls.

  • What context window does GPT 5.4 Nano support?

    400K tokens, which is substantial for a model at this price tier.

  • Does GPT 5.4 Nano support the verbosity parameter?

    Yes. It supports verbosity and reasoning level parameters, giving you control over response detail and reasoning depth per request.

  • What tasks is GPT 5.4 Nano designed for?

    High-volume sub-agent workflows, classification, routing, lightweight code checks, and batch processing where per-call cost is the dominant concern.

  • Can GPT 5.4 Nano handle complex reasoning?

    For complex multi-step reasoning, GPT-5.4 Mini or the full GPT-5.4 will produce better results. GPT 5.4 Nano is optimized for simpler tasks at high volume.

  • How does AI Gateway handle authentication for GPT 5.4 Nano?

    AI Gateway accepts a single API key or OIDC token for all requests. You don't embed OpenAI credentials in your application; AI Gateway routes and authenticates on your behalf.

  • What are typical latency characteristics?

    This page shows live throughput and time-to-first-token metrics measured across real AI Gateway traffic.