1 min read
Gemini 3.1 Flash Lite from Google is now available on AI Gateway.
This model outperforms 2.5 Flash Lite on overall quality, with notable improvements in translation, data extraction, and code completion. Gemini 3.1 Flash Lite is best suited for high-volume agentic tasks, data extraction, and applications where budget and latency are the primary evaluation constraints.
To use this model, set model to google/gemini-3.1-flash-lite-preview in the AI SDK. This model supports four thinking levels, minimal, low, medium, and high.
import { streamText } from 'ai';
const result = streamText({ model: 'google/gemini-3.1-flash-lite-preview', prompt: `Translate this customer support article from English to Japanese, preserving formatting and technical terms.`, providerOptions: { google: { thinkingConfig: { thinkingLevel: 'medium', includeThoughts: true, }, }, },});AI Gateway provides a unified API for calling models, tracking usage and cost, and configuring retries, failover, and performance optimizations for higher-than-provider uptime. It includes built-in observability, Bring Your Own Key support, and intelligent provider routing with automatic retries.
Learn more about AI Gateway, view the AI Gateway model leaderboard or try it in our model playground.