Question 1

How is Gemini 3.1 Flash Lite different from `google/gemini-3.1-flash-lite-preview`?

Accepted Answer

Gemini 3.1 Flash Lite is the general-availability release of the same efficiency tier in the Gemini 3.1 family. The preview entry remains in the catalog for teams pinned to the earlier identifier; the GA model is the recommended target for new production work.

Question 2

How does Gemini 3.1 Flash Lite compare to Gemini 2.5 Flash Lite?

Accepted Answer

Gemini 3.1 Flash Lite outperforms 2.5 Flash Lite on overall quality and lands close to 2.5 Flash across reasoning, multimodal understanding, agentic tool use, and long-context performance. For teams already running 2.5 Flash Lite at scale, it's a quality upgrade within the same lite tier.

Question 3

What thinking levels does Gemini 3.1 Flash Lite support and how do they affect cost?

Accepted Answer

Four levels: `minimal`, `low`, `medium`, and `high`. Higher levels add reasoning compute that contributes to output token counts, so the choice trades off latency and per-request cost against quality on harder inputs.

Question 4

Can I mix thinking levels across requests in the same application?

Accepted Answer

Yes. Set `thinkingLevel` per request in `providerOptions.google.thinkingConfig`. Routine requests can run at `minimal` while flagged hard cases run at `medium` or `high` without any architectural changes.

Question 5

Does Gemini 3.1 Flash Lite support multimodal inputs?

Accepted Answer

Yes. Gemini 3.1 Flash Lite accepts text, images, audio, and documents as input within the 1M tokens context window and returns text output. Web search and implicit caching are available as runtime options.

Question 6

How do I call Gemini 3.1 Flash Lite on AI Gateway?

Accepted Answer

Use the identifier `google/gemini-3.1-flash-lite` with the AI SDK, the OpenAI-compatible Chat Completions endpoint, the Responses API, or any other supported interface. AI Gateway handles provider routing, retries, and failover automatically.

Question 7

How does Zero Data Retention work with Gemini 3.1 Flash Lite through AI Gateway?

Accepted Answer

Yes, Zero Data Retention is available for this model. Zero Data Retention is offered on a per-provider basis. See https://vercel.com/docs/ai-gateway/capabilities/zdr for details.

AI Cloud

Core Platform

Security

Company

Learn

Open Source

Use Cases

Tools

Users

Gemini 3.1 Flash Lite

Frequently Asked Questions