Skip to content

GPT 4o Mini Search Preview

GPT 4o Mini Search Preview combines GPT-4o mini's cost efficiency with built-in web search, grounding responses in real-time information from the web without requiring external search tool integration.

Web Search
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'openai/gpt-4o-mini-search-preview',
prompt: 'Why is the sky blue?'
})

Frequently Asked Questions

  • How does the built-in search differ from using a separate search API?

    The model decides what to search for, when to search, and how to synthesize results as part of its reasoning flow. This produces more naturally integrated answers compared to prepending search results to a prompt.

  • Does GPT 4o Mini Search Preview always search the web?

    No. The model determines whether a web search would improve its response. For questions answerable from training data alone, it may skip the search step.

  • What is the pricing for GPT 4o Mini Search Preview?

    Current pricing is shown on this page. AI Gateway routes across providers, and rates may vary by provider.

  • Can I use GPT 4o Mini Search Preview for real-time customer support?

    Yes. It can ground answers in current documentation, pricing pages, and product information from the web, making support responses more accurate and up-to-date.

  • How does AI Gateway handle authentication for GPT 4o Mini Search Preview?

    AI Gateway accepts a single API key or OIDC token for all requests. You don't embed OpenAI credentials in your application; AI Gateway routes and authenticates on your behalf.

  • What context window does GPT 4o Mini Search Preview support?

    GPT 4o Mini Search Preview supports a context window of 128K tokens, consistent with the GPT-4o mini family.

  • What are typical latency characteristics?

    This page shows live throughput and time-to-first-token metrics measured across real AI Gateway traffic. Search-augmented responses may take slightly longer due to web retrieval.