Gemini 3.5 Flash

Gemini 3.5 Flash advances the Flash line with improved coding proficiency, parallel agentic execution, stronger core reasoning, tighter instruction following, and higher-quality reasoning traces in thinking mode.

ReasoningFile InputVision (Image)Tool UseWeb SearchImplicit Caching

index.ts

import { streamText } from 'ai'

const result = streamText({
  model: 'google/gemini-3.5-flash',
  prompt: 'Why is the sky blue?'
})

Overview About Providers Throughput Latency Uptime Status Similar FAQ

About Gemini 3.5 Flash

Gemini 3.5 Flash is Google's update to the Flash tier, building on Gemini 3 Flash with focused improvements for coding workflows and agentic execution. Coding proficiency and parallel agentic execution loops both improve over previous Flash versions, which makes Gemini 3.5 Flash a better fit for agents that issue concurrent tool calls or refactor code across multiple files in one pass.

Core reasoning, instruction following, and multi-turn coherence all see upgrades. For complex tasks the model produces higher-quality reasoning traces in thinking mode, which is useful when you need to audit the model's intermediate steps or train downstream systems on chain-of-thought data. Gemini 3.5 Flash defaults to the medium thinking level, balancing quality against faster, more cost-efficient generation, and exposes thinkingLevel and includeThoughts through providerOptions for finer control.

Because Gemini 3.5 Flash sits at the intersection of agentic capability and Flash-tier throughput, it suits production traffic patterns from low-latency chat interfaces to high-volume code-transformation pipelines. Accessing Gemini 3.5 Flash through AI Gateway adds observability, automatic retries, and provider failover without requiring a Google Cloud account.

Agent Stack

Core Platform

Tools

Learn

Build

Explore

Gemini 3.5 Flash

About Gemini 3.5 Flash