Skip to content

Gemini 3.1 Flash Lite Preview

Gemini 3.1 Flash Lite Preview is the efficiency-focused model in the Gemini 3.1 generation for budget-constrained, high-volume workloads, with notable gains in translation, data extraction, and code completion over Gemini 2.5 Flash Lite and four configurable thinking levels.

ReasoningTool UseImplicit CachingFile InputVision (Image)Web Search
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'google/gemini-3.1-flash-lite-preview',
prompt: 'Why is the sky blue?'
})

Providers

Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.

Provider
Context
Latency
Throughput
Input
Output
Cache
Web Search
Per Query
Capabilities
ZDR
No Training
Release Date
Google
Legal:Terms
Privacy
1M
1.0s
245tps
$0.25/M$1.50/M
Read:$0.03/M
Write:
$14.00/K
+ input costs
03/03/2026
Google Vertex
Legal:Terms
Privacy
1M
0.9s
212tps
$0.25/M$1.50/M
Read:$0.03/M
Write:
$14.00/K
+ input costs
03/03/2026