Gemma 4 31B IT
Gemma 4 31B IT is Google's open-weight dense model with 31B parameters, all active during inference. Built on the Gemini 3 architecture, it targets higher output quality than its MoE sibling, with support for function-calling, structured JSON output, native vision, and 140+ languages.
import { streamText } from 'ai'
const result = streamText({ model: 'google/gemma-4-31b-it', prompt: 'Why is the sky blue?'})Frequently Asked Questions
What makes Gemma 4 31B IT different from the MoE Gemma 4 26B?
Gemma 4 31B IT is a dense model, meaning all 31B parameters are active during every forward pass. The MoE Gemma 4 26B activates roughly 4B of its 26B total parameters per pass. Gemma 4 31B IT targets higher output quality; the 26B variant targets lower latency and cost.
What input modalities does Gemma 4 31B IT support?
Gemma 4 31B IT accepts text and image inputs within a context window of 262.1K tokens. It does not generate images or audio.
How does Gemma 4 31B IT relate to Google's Gemini models?
Gemma 4 31B IT is built on the same architecture as Gemini 3 but with open weights. It shares capabilities like function-calling, structured output, and system instructions. Gemini models remain proprietary; Gemma 4 31B IT lets you inspect or adapt the weights.
What languages does Gemma 4 31B IT support?
Over 140 languages. The instruction-tuning covers multilingual conversational and task-oriented use cases.
How do I use Gemma 4 31B IT on AI Gateway?
Set the model to
google/gemma-4-31b-itin the AI SDK. AI Gateway handles provider routing, retries, and failover automatically.Does Gemma 4 31B IT support function-calling?
Yes. It supports function-calling for agentic workflows, structured JSON output, and system instructions natively, inherited from the Gemini 3 architecture.