GPT-4.1 nano

GPT-4.1 nano is the smallest and fastest model in the GPT-4.1 family, designed for high-volume, low-latency tasks like classification, autocomplete, and routing, delivering strong results on MMLU at the lowest price point in the GPT-4.1 lineup.

File InputImplicit CachingTool UseVision (Image)Web Search

index.ts

import { streamText } from 'ai'

const result = streamText({
  model: 'openai/gpt-4.1-nano',
  prompt: 'Why is the sky blue?'
})

Overview About Providers Latency Uptime Status Similar FAQ

More models by OpenAI

Model

Context	Latency	Throughput	Input	Output	Cache	Web Search	Per Query	Capabilities	Providers	ZDR	No Training	Release Date

openai/gpt-5.5

2.4s

51tps

$5.00/M

$30.00/M

Read:

$0.5/M

Write:

—

$10.00/K

+ input costs

—

04/24/2026

openai/gpt-5.4-mini

400K

1.0s

166tps

$0.75/M

$4.50/M

Read:$0.07/M

Write:—

$10.00/K

+ input costs

—

03/17/2026

openai/gpt-5.4

1.1M

3.4s

89tps

$2.50/M

$15.00/M

Read:

$0.25/M

Write:

—

$10.00/K

+ input costs

—

03/05/2026

openai/gpt-5-mini

400K

3.5s

125tps

$0.25/M

$2.00/M

Read:$0.03/M

Write:—

$14/K

+ input costs

—

08/07/2025

openai/gpt-oss-120b

131K

0.2s

377tps

$0.35/M

$0.75/M

Read:$0.25/M

Write:—

—

08/05/2025

openai/gpt-4.1-mini

0.6s

113tps

$0.40/M

$1.60/M

Read:$0.1/M

Write:—

$14/K

+ input costs

—

05/14/2025

Agent Stack

Core Platform

Tools

Learn

Build

Explore

GPT-4.1 nano

More models by OpenAI