Skip to content

GPT-5

GPT-5 is the standard tier of the GPT-5 model family, unifying advanced reasoning, coding, and multimodal capabilities in a single architecture that surpasses its predecessors across benchmarks while maintaining broad general-purpose utility.

File InputReasoningTool UseVision (Image) Image GenImplicit Caching
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'openai/gpt-5',
prompt: 'Why is the sky blue?'
})

Frequently Asked Questions

  • How does GPT-5 compare to GPT-4o?

    GPT-5 represents a generational improvement over GPT-4o across reasoning, coding, instruction following, and multimodal capabilities.

  • What context window does GPT-5 support?

    GPT-5 supports a context window of 400K tokens, enabling full-codebase analysis and extended document processing in a single request.

  • When should I use GPT-5 versus GPT-5 mini?

    Use GPT-5 when you need the highest-tier capability in the GPT-5 family. Use GPT-5 mini when you need strong performance at lower cost, particularly for high-volume production workloads.

  • Does GPT-5 support multimodal input?

    Yes. It accepts text and image inputs, enabling vision-based analysis, document processing with figures, and mixed-modality workflows.

  • How does AI Gateway handle authentication for GPT-5?

    AI Gateway accepts a single API key or OIDC token for all requests. You don't embed OpenAI credentials in your application; AI Gateway routes and authenticates on your behalf.

  • What is the pricing for GPT-5?

    See the pricing section on this page for today's rates. AI Gateway exposes each provider's pricing for GPT-5.

  • What are typical latency characteristics?

    This page shows live throughput and time-to-first-token metrics measured across real AI Gateway traffic.