Gemini 3.1 Flash Lite is now on AI Gateway

Gemini 3.1 Flash Lite from Google is now available on AI Gateway.

This model outperforms 2.5 Flash Lite on overall quality, with notable improvements in translation, data extraction, and code completion. Gemini 3.1 Flash Lite is best suited for high-volume agentic tasks, data extraction, and applications where budget and latency are the primary evaluation constraints.

To use this model, set model to google/gemini-3.1-flash-lite-preview in the AI SDK. This model supports four thinking levels, minimal, low, medium, and high.

import { streamText } from 'ai';
const result = streamText({
  model: 'google/gemini-3.1-flash-lite-preview',
  prompt:
    `Translate this customer support article from English to Japanese,
     preserving formatting and technical terms.`,
  providerOptions: {
    google: {
      thinkingConfig: {
        thinkingLevel: 'medium',
        includeThoughts: true,
      },
    },
  },
});

AI Gateway provides a unified API for calling models, tracking usage and cost, and configuring retries, failover, and performance optimizations for higher-than-provider uptime. It includes built-in observability, Bring Your Own Key support, and intelligent provider routing with automatic retries.

Learn more about AI Gateway, view the AI Gateway model leaderboard or try it in our model playground.

Agent Stack

Core Platform

Tools

Learn

Build

Explore

Gemini 3.1 Flash Lite is now on AI Gateway

Ready to deploy?