Skip to content

Gemma 4 31B IT

Gemma 4 31B IT is Google's open-weight dense model with 31B parameters, all active during inference. Built on the Gemini 3 architecture, it targets higher output quality than its MoE sibling, with support for function-calling, structured JSON output, native vision, and 140+ languages.

Tool UseVision (Image)File Input
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'google/gemma-4-31b-it',
prompt: 'Why is the sky blue?'
})

Providers

Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.

Provider
Context
Latency
Throughput
Input
Output
Cache
Web Search
Per Query
Capabilities
ZDR
No Training
Release Date
Novita AI
262K
0.9s
22tps
$0.14/M$0.40/M
+1
04/02/2026
Parasail
262K
0.6s
111tps
$0.14/M$0.40/M
04/02/2026