Gemini 2.5 Flash Preview 09-2025
Gemini 2.5 Flash Preview 09-2025 is Google's September 2025 preview of the next Gemini 2.5 Flash, scoring 54% on SWE-Bench Verified (up from 48.9%), improving agentic tool use, and reducing thinking-mode output tokens compared to the stable release.
import { streamText } from 'ai'
const result = streamText({ model: 'google/gemini-2.5-flash-preview-09-2025', prompt: 'Why is the sky blue?'})Playground
Try out Gemini 2.5 Flash Preview 09-2025 by Google. Usage is billed to your team at API rates. Free users (those who haven't made a payment) get $5 of credits every 30 days.
Ask Gemini 2.5 Flash Preview 09-2025 anything to try it out.
Providers
Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.
| Provider |
|---|
P50 throughput on live AI Gateway traffic, in tokens per second (TPS). Visit the docs for more info.
P50 time to first token (TTFT) on live AI Gateway traffic, in milliseconds. View the docs for more info.
Direct request success rate on AI Gateway and per-provider. Visit the docs for more info.
More models by Google
| Model |
|---|
About Gemini 2.5 Flash Preview 09-2025
Gemini 2.5 Flash Preview 09-2025 is a preview release from Google dated September 25, 2025. It advances the hybrid reasoning model that defined the 2.5 Flash tier. Two improvements stand out.
First, agentic tool use. Google called out better performance on complex, multi-step applications. The SWE-Bench Verified score moved from 48.9% to 54%, a five-point gain on real-world software engineering tasks involving bug fixes and feature implementations across production codebases.
Second, efficiency with thinking enabled. The preview produces fewer output tokens compared to the stable model when thinking mode is active. Fewer thinking tokens means lower cost and faster responses on reasoning-intensive prompts while maintaining quality.
Like the Flash Lite preview released alongside it, this model uses Google's -latest alias system with two-week deprecation notices. Pin to gemini-2.5-flash-preview-09-2025 for stable evaluation. Google intends the preview for feedback collection, not as a direct replacement for the stable 2.5 Flash.
What To Consider When Choosing a Provider
- Configuration: This is a preview model. Reduced thinking-token usage shifts cost profiles for reasoning-heavy workloads. Benchmark against the stable 2.5 Flash using AI Gateway's observability tools before committing production traffic.
- Zero Data Retention: AI Gateway does not currently support Zero Data Retention for this model. See the documentation for models that support ZDR.
- Authentication: AI Gateway authenticates requests using an API key or OIDC token. You do not need to manage provider credentials directly.
When to Use Gemini 2.5 Flash Preview 09-2025
Best For
- Agentic coding pipelines: The 54% SWE-Bench Verified score represents a concrete improvement over the stable 2.5 Flash
- Multi-step tool use applications: Benefit from improved agentic capabilities across complex workflows
- Reasoning-intensive workloads with cost constraints: Lower thinking-token usage reduces spend
- Hybrid reasoning applications: Toggle thinking on and off per request and need better cost efficiency when thinking is active
- Software engineering automation: Bug triage, code review, and feature implementation
Consider Alternatives When
- Production stability required: Use the stable Gemini 2.5 Flash for production workloads
- Simple classification or extraction: Thinking adds cost without benefit, and Gemini 2.5 Flash Lite is cheaper
- Deepest reasoning needed: Gemini 2.5 Pro targets the hardest problems with no cost constraint
Conclusion
Gemini 2.5 Flash Preview 09-2025 delivers two concrete gains over the stable 2.5 Flash: stronger agentic tool use (54% SWE-Bench Verified) and cheaper thinking (fewer output tokens in thinking mode). For teams already using 2.5 Flash on reasoning or coding tasks, this preview is worth benchmarking in staging.