Question 1

How does the built-in search differ from using a separate search API?

Accepted Answer

The model decides what to search for, when to search, and how to synthesize results as part of its reasoning flow. This produces more naturally integrated answers compared to prepending search results to a prompt.

Question 2

Does GPT 4o Mini Search Preview always search the web?

Accepted Answer

No. The model determines whether a web search would improve its response. For questions answerable from training data alone, it may skip the search step.

Question 3

What is the pricing for GPT 4o Mini Search Preview?

Accepted Answer

Current pricing is shown on this page. AI Gateway routes across providers, and rates may vary by provider.

Question 4

Can I use GPT 4o Mini Search Preview for real-time customer support?

Accepted Answer

Yes. It can ground answers in current documentation, pricing pages, and product information from the web, making support responses more accurate and up-to-date.

Question 5

How does AI Gateway handle authentication for GPT 4o Mini Search Preview?

Accepted Answer

AI Gateway accepts a single API key or OIDC token for all requests. You don't embed OpenAI credentials in your application; AI Gateway routes and authenticates on your behalf.

Question 6

What context window does GPT 4o Mini Search Preview support?

Accepted Answer

GPT 4o Mini Search Preview supports a context window of 128K tokens, consistent with the GPT-4o mini family.

Question 7

What are typical latency characteristics?

Accepted Answer

This page shows live throughput and time-to-first-token metrics measured across real AI Gateway traffic. Search-augmented responses may take slightly longer due to web retrieval.

Agent Stack

Core Platform

Tools

Learn

Build

Explore

GPT 4o Mini Search Preview

Frequently Asked Questions