Skip to content

Grok 3 Mini Fast Beta

Grok 3 Mini Fast Beta is the fastest and most cost-efficient model in xAI's Grok 3 family. It combines the compact Grok 3 Mini architecture with speed optimization for maximum throughput at the lowest per-token cost.

Tool Use
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'xai/grok-3-mini-fast',
prompt: 'Why is the sky blue?'
})

More models by xAI

Model
Context
Latency
Throughput
Input
Output
Cache
Web Search
Per Query
Capabilities
Providers
ZDR
No Training
Release Date
1M
0.9s
183tps
$1.25/M
$2.50/M
Read:
$0.2/M
Write:
$5/K
+ input costs
xai logo
04/30/2026
2M
0.6s
125tps
$1.25/M
$2.50/M
Read:
$0.2/M
Write:
$5/K
+ input costs
vertex logo
xai logo
03/09/2026
2M
4.3s
904tps
$1.25/M
$2.50/M
Read:
$0.2/M
Write:
$5/K
+ input costs
xai logo
03/09/2026
2M
0.4s
48tps
$1.25/M
$2.50/M
Read:
$0.2/M
Write:
$5/K
+ input costs
vertex logo
xai logo
03/09/2026
1M
0.2s
85tps
$0.20/M$0.50/M
Read:$0.05/M
Write:
vertex logo
07/09/2025
1M
0.7s
343tps
$0.20/M$0.50/M
Read:$0.05/M
Write:
vertex logo
07/09/2025