Gemini 2.5 Flash is a thinking model that offers great, well-rounded capabilities. It is designed to offer a balance between price and performance with multimodal support and a 1M token context window.
import { streamText } from 'ai'
const result = streamText({ model: 'google/gemini-2.5-flash', prompt: 'What is the history of the San Francisco Mission-style burrito?'})Try out Gemini 2.5 Flash by Google. Usage is billed to your team at API rates. Free users get $5 of credits every 30 days, and you are considered a free user if you haven't made a payment.
Google Vertex
Context 1M
Input Tokens $0.30/M
Output Tokens $2.50/M
Cache Read Tokens $0.03/M
Web Search Calls $35.00/K
Context
Input Tokens $0.30/M
Output Tokens $2.50/M
Cache Read Tokens $0.03/M
Web Search Calls $35.00/K
Nano Banana Preview (Gemini 2.5 Flash Image Preview)
Gemini 2.5 Flash Image Preview is Google's first fully hybrid reasoning model, letting developers turn thinking on or off and set thinking budgets to balance quality, cost, and latency. Upgraded for rapid creative workflows, it can generate interleaved text and images and supports conversational, multi‑turn image editing in natural language. It’s also locale‑aware, enabling culturally and linguistically appropriate image generation for audiences worldwide.