Skip to content

Grok 3 Mini Fast Beta

Grok 3 Mini Fast Beta is the fastest and most cost-efficient model in xAI's Grok 3 family. It combines the compact Grok 3 Mini architecture with speed optimization for maximum throughput at the lowest per-token cost.

Tool Use
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'xai/grok-3-mini-fast',
prompt: 'Why is the sky blue?'
})

Frequently Asked Questions

  • What is Grok 3 Mini Fast Beta optimized for?

    Maximum inference speed and minimum cost per token. It combines the compact Grok 3 Mini architecture with additional speed optimization for the highest throughput in the Grok 3 family.

  • How does Grok 3 Mini Fast Beta compare to Grok 3 Mini?

    Grok 3 Mini Fast Beta adds latency optimization on top of the Grok 3 Mini architecture. It's faster but may trade marginal quality on complex reasoning tasks.

  • What is the context window?

    131.1K tokens.

  • What does Grok 3 Mini Fast Beta cost?

    Rates are listed on this page. They reflect the providers routing through AI Gateway and shift when providers update their pricing.

  • How do I authenticate with Grok 3 Mini Fast Beta through Vercel AI Gateway?

    Use your Vercel AI Gateway API key with xai/grok-3-mini-fast as the model identifier. No separate xAI account is needed for gateway-managed access.

  • Can Grok 3 Mini Fast Beta handle code generation?

    It handles basic code generation, boilerplate, and simple transformations. For complex coding tasks, Grok 3 Fast or Grok Code Fast 1 are better suited.

  • Does Vercel AI Gateway support Zero Data Retention for Grok 3 Mini Fast Beta?

    Zero Data Retention is not currently available for this model. ZDR on AI Gateway applies to direct gateway requests; BYOK flows aren't covered. See https://vercel.com/docs/ai-gateway/capabilities/zdr for details.