GPT-5 nano

GPT-5 nano is the fastest and most affordable model in the GPT-5 family, designed for high-throughput, low-latency tasks like classification, routing, autocomplete, and lightweight inference at scale.

File InputImplicit CachingReasoningTool UseVision (Image)Web Search

index.ts

import { streamText } from 'ai'

const result = streamText({
  model: 'openai/gpt-5-nano',
  prompt: 'Why is the sky blue?'
})

Overview About Providers Throughput Latency Uptime Status Similar FAQ

About GPT-5 nano

GPT-5 nano was released on August 7, 2025 as the entry-level tier of the GPT-5 model family. It's optimized for the highest throughput and lowest latency in the family, targeting workloads where speed and cost matter more than reasoning depth.

Despite being the smallest GPT-5 variant, GPT-5 nano benefits from the family's architectural improvements. It handles classification, routing, extraction, and simple generation tasks with quality that reflects the generational leap from GPT-4.1 nano. The context window of 400K tokens is notable for a model at this tier, enabling it to process long inputs even when outputs remain short.

The model is designed to serve as a building block in larger systems: classifying incoming requests, routing them to appropriate handlers, extracting key fields from documents, and providing instant responses for simple queries, all at a cost that makes per-request inference viable for the highest-traffic applications.

Agent Stack

Core Platform

Tools

Learn

Build

Explore

GPT-5 nano

About GPT-5 nano