Overview Deployments Analytics Speed Insights Logs Observability Firewall AI Gateway Storage Flags Settings

Gemini 2.5 Flash

google/gemini-2.5-flash

Overview

Playground

Documentation

About

Gemini 2.5 Flash is a thinking model that offers great, well-rounded capabilities. It is designed to offer a balance between price and performance with multimodal support and a 1M token context window.

import { streamText } from 'ai'

const result = streamText({
  model: 'google/gemini-2.5-flash',
  prompt: 'What is the history of the San Francisco Mission-style burrito?'
})

Capabilities

file-inputreasoningtool-usevision

Playground

Try out Gemini 2.5 Flash by Google. Usage is billed to your team at API rates. Free users get $5 of credits every 30 days, and you are considered a free user if you haven't made a payment.

Providers

The AI Gateway supports routing requests across multiple AI providers. You can control provider preferences using the provider slugs available for copying with the buttons below. For more see the AI Gateway provider options documentation.

Google Vertex

Context 1M

Input Tokens $0.30/M

Output Tokens $2.50/M

Cache Read Tokens $0.03/M

Web Search Calls $35.00/K

Terms Privacy

Google

Context

More models by Google

Gemini 2.0 Flash

Gemini 2.0 Flash delivers next-gen features and improved capabilities, including superior speed, built-in tool use, multimodal generation, and a 1M token context window.

Gemini 2.0 Flash Lite

Gemini 2.0 Flash delivers next-gen features and improved capabilities, including superior speed, built-in tool use, multimodal generation, and a 1M token context window.

Nano Banana (Gemini 2.5 Flash Image)

Nano Banana (Gemini 2.5 Flash Image) is Google's first fully hybrid reasoning model, letting developers turn thinking on or off and set thinking budgets to balance quality, cost, and latency. Upgraded for rapid creative workflows, it can generate interleaved text and images and supports conversational, multi‑turn image editing in natural language. It’s also locale‑aware, enabling culturally and linguistically appropriate image generation for audiences worldwide.