Gemini 2.0 Flash Lite

Gemini 2.0 Flash Lite is the lowest-cost multimodal model in Google's 2.0 lineup. This text-output-only model accepts text, images, audio, and documents within a context window of 1.0M tokens, designed for budget-first workloads where output volume drives infrastructure cost.

File InputTool UseVision (Image)Web Search

index.ts

import { streamText } from 'ai'

const result = streamText({
  model: 'google/gemini-2.0-flash-lite',
  prompt: 'Why is the sky blue?'
})

Overview Playground About Providers Uptime Status Similar FAQ

More models by Google

Model

Context	Latency	Throughput	Input	Output	Cache	Web Search	Per Query	Capabilities	Providers	ZDR	No Training	Release Date

google/gemini-3.5-flash

2.4s

279tps

$1.50/M

$9.00/M

Read:$0.15/M

Write:—

$14.00/K

+ input costs

—

05/19/2026

google/gemini-3.1-flash-lite-preview

0.6s

244tps

$0.25/M

$1.50/M

Read:$0.03/M

Write:—

$14.00/K

+ input costs

—

03/03/2026

google/gemini-3.1-pro-preview

5.9s

290tps

$2.00/M

$12.00/M

Read:

$0.2/M

Write:

—

$14.00/K

+ input costs

—

02/19/2026

google/gemini-3-flash

0.8s

167tps

$0.50/M

$3.00/M

Read:

$0.05/M

Write:

—

$14.00/K

+ input costs

—

12/17/2025

google/gemini-2.5-flash-lite

0.3s

243tps

$0.10/M

$0.40/M

Read:$0.01/M

Write:—

$35.00/K

+ input costs

—

06/17/2025

google/gemini-2.5-pro

1.9s

126tps

$1.25/M

$10.00/M

Read:

$0.13/M

Write:

—

$35.00/K

+ input costs

—

03/20/2025

AI Cloud

Core Platform

Security

Company

Learn

Open Source

Use Cases

Tools

Users

Gemini 2.0 Flash Lite

More models by Google