Grok 3 Mini Fast Beta
Grok 3 Mini Fast Beta is the fastest and most cost-efficient model in xAI's Grok 3 family. It combines the compact Grok 3 Mini architecture with speed optimization for maximum throughput at the lowest per-token cost.
import { streamText } from 'ai'
const result = streamText({ model: 'xai/grok-3-mini-fast', prompt: 'Why is the sky blue?'})Frequently Asked Questions
What is Grok 3 Mini Fast Beta optimized for?
Maximum inference speed and minimum cost per token. It combines the compact Grok 3 Mini architecture with additional speed optimization for the highest throughput in the Grok 3 family.
How does Grok 3 Mini Fast Beta compare to Grok 3 Mini?
Grok 3 Mini Fast Beta adds latency optimization on top of the Grok 3 Mini architecture. It's faster but may trade marginal quality on complex reasoning tasks.
What is the context window?
131.1K tokens.
What does Grok 3 Mini Fast Beta cost?
Rates are listed on this page. They reflect the providers routing through AI Gateway and shift when providers update their pricing.
How do I authenticate with Grok 3 Mini Fast Beta through Vercel AI Gateway?
Use your Vercel AI Gateway API key with
xai/grok-3-mini-fastas the model identifier. No separate xAI account is needed for gateway-managed access.Can Grok 3 Mini Fast Beta handle code generation?
It handles basic code generation, boilerplate, and simple transformations. For complex coding tasks, Grok 3 Fast or Grok Code Fast 1 are better suited.
Does Vercel AI Gateway support Zero Data Retention for Grok 3 Mini Fast Beta?
Zero Data Retention is not currently available for this model. ZDR on AI Gateway applies to direct gateway requests; BYOK flows aren't covered. See https://vercel.com/docs/ai-gateway/capabilities/zdr for details.