Gemini 3.5 Flash
Gemini 3.5 Flash advances the Flash line with improved coding proficiency, parallel agentic execution, stronger core reasoning, tighter instruction following, and higher-quality reasoning traces in thinking mode.
import { streamText } from 'ai'
const result = streamText({ model: 'google/gemini-3.5-flash', prompt: 'Why is the sky blue?'})About Gemini 3.5 Flash
Gemini 3.5 Flash is Google's update to the Flash tier, building on Gemini 3 Flash with focused improvements for coding workflows and agentic execution. Coding proficiency and parallel agentic execution loops both improve over previous Flash versions, which makes Gemini 3.5 Flash a better fit for agents that issue concurrent tool calls or refactor code across multiple files in one pass.
Core reasoning, instruction following, and multi-turn coherence all see upgrades. For complex tasks the model produces higher-quality reasoning traces in thinking mode, which is useful when you need to audit the model's intermediate steps or train downstream systems on chain-of-thought data. Gemini 3.5 Flash defaults to the medium thinking level, balancing quality against faster, more cost-efficient generation, and exposes thinkingLevel and includeThoughts through providerOptions for finer control.
Because Gemini 3.5 Flash sits at the intersection of agentic capability and Flash-tier throughput, it suits production traffic patterns from low-latency chat interfaces to high-volume code-transformation pipelines. Accessing Gemini 3.5 Flash through AI Gateway adds observability, automatic retries, and provider failover without requiring a Google Cloud account.