Skip to content
Vercel April 2026 security incident

GPT-5

openai/gpt-5

GPT-5 is the standard tier of the GPT-5 model family, unifying advanced reasoning, coding, and multimodal capabilities in a single architecture that surpasses its predecessors across benchmarks while maintaining broad general-purpose utility.

File InputReasoningTool UseVision (Image) Image GenImplicit Caching
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'openai/gpt-5',
prompt: 'Why is the sky blue?'
})

What To Consider When Choosing a Provider

  • Zero Data Retention

    AI Gateway supports Zero Data Retention for this model via direct gateway requests (BYOK is not included). To configure this, check the documentation.

    Authentication

    AI Gateway authenticates requests using an API key or OIDC token. You do not need to manage provider credentials directly.

GPT-5 provides the full capability of the GPT-5 architecture. It's the right choice when you need high overall quality across reasoning, coding, and creative tasks and cost is secondary to capability.

For cost-sensitive deployments, consider GPT-5 mini or GPT-5 nano, which deliver much of the same architectural improvement at lower price points.

When to Use GPT-5

Best For

  • Complex multi-step reasoning:

    Research analysis, strategic planning, and problems requiring extended deliberation

  • Advanced code generation:

    Full-repository comprehension, architectural decisions, and complex refactoring

  • Multimodal workflows:

    Processing images, documents, and text together in sophisticated analysis pipelines

  • High-stakes content generation:

    Legal drafting, technical documentation, and precision-critical writing

  • Agentic systems:

    Backbone model for autonomous agents that need the full capability of the GPT-5 family

Consider Alternatives When

  • Cost-sensitive workloads:

    GPT-5 mini offers strong capability at a lower price point

  • Lightweight tasks:

    GPT-5 nano or GPT-4.1 nano handle classification and routing more efficiently

  • Specialized reasoning:

    The o-series reasoning models may outperform on pure mathematical and scientific reasoning tasks

  • Speed-critical applications:

    Smaller models provide faster time-to-first-token for real-time chat

Conclusion

GPT-5 advanced reasoning, coding, and multimodal capability when the GPT-5 family launched. For applications routed through AI Gateway that need the full GPT-5 family capability, it is the standard tier.

FAQ

GPT-5 represents a generational improvement over GPT-4o across reasoning, coding, instruction following, and multimodal capabilities.

GPT-5 supports a context window of 400K tokens, enabling full-codebase analysis and extended document processing in a single request.

Use GPT-5 when you need the highest-tier capability in the GPT-5 family. Use GPT-5 mini when you need strong performance at lower cost, particularly for high-volume production workloads.

Yes. It accepts text and image inputs, enabling vision-based analysis, document processing with figures, and mixed-modality workflows.

AI Gateway accepts a single API key or OIDC token for all requests. You don't embed OpenAI credentials in your application; AI Gateway routes and authenticates on your behalf.

See the pricing section on this page for today's rates. AI Gateway exposes each provider's pricing for GPT-5.

This page shows live throughput and time-to-first-token metrics measured across real AI Gateway traffic.