Skip to content

Gemma 4 31B IT

Gemma 4 31B IT is Google's open-weight dense model with 31B parameters, all active during inference. Built on the Gemini 3 architecture, it targets higher output quality than its MoE sibling, with support for function-calling, structured JSON output, native vision, and 140+ languages.

Tool UseVision (Image)File Input
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'google/gemma-4-31b-it',
prompt: 'Why is the sky blue?'
})

Frequently Asked Questions

  • What makes Gemma 4 31B IT different from the MoE Gemma 4 26B?

    Gemma 4 31B IT is a dense model, meaning all 31B parameters are active during every forward pass. The MoE Gemma 4 26B activates roughly 4B of its 26B total parameters per pass. Gemma 4 31B IT targets higher output quality; the 26B variant targets lower latency and cost.

  • What input modalities does Gemma 4 31B IT support?

    Gemma 4 31B IT accepts text and image inputs within a context window of 262.1K tokens. It does not generate images or audio.

  • How does Gemma 4 31B IT relate to Google's Gemini models?

    Gemma 4 31B IT is built on the same architecture as Gemini 3 but with open weights. It shares capabilities like function-calling, structured output, and system instructions. Gemini models remain proprietary; Gemma 4 31B IT lets you inspect or adapt the weights.

  • What languages does Gemma 4 31B IT support?

    Over 140 languages. The instruction-tuning covers multilingual conversational and task-oriented use cases.

  • How do I use Gemma 4 31B IT on AI Gateway?

    Set the model to google/gemma-4-31b-it in the AI SDK. AI Gateway handles provider routing, retries, and failover automatically.

  • Does Gemma 4 31B IT support function-calling?

    Yes. It supports function-calling for agentic workflows, structured JSON output, and system instructions natively, inherited from the Gemini 3 architecture.