What is Grok 3 Mini Fast Beta optimized for?

Maximum inference speed and minimum cost per token. It combines the compact Grok 3 Mini architecture with additional speed optimization for the highest throughput in the Grok 3 family.

How does Grok 3 Mini Fast Beta compare to Grok 3 Mini?

Grok 3 Mini Fast Beta adds latency optimization on top of the Grok 3 Mini architecture. It's faster but may trade marginal quality on complex reasoning tasks.

What does Grok 3 Mini Fast Beta cost?

Rates are listed on this page. They reflect the providers routing through AI Gateway and shift when providers update their pricing.

How do I authenticate with Grok 3 Mini Fast Beta through Vercel AI Gateway?

Use your Vercel AI Gateway API key with `xai/grok-3-mini-fast` as the model identifier. No separate xAI account is needed for gateway-managed access.

Can Grok 3 Mini Fast Beta handle code generation?

It handles basic code generation, boilerplate, and simple transformations. For complex coding tasks, Grok 3 Fast or Grok Code Fast 1 are better suited.

Does Vercel AI Gateway support Zero Data Retention for Grok 3 Mini Fast Beta?

Zero Data Retention is not currently available for this model. ZDR on AI Gateway applies to direct gateway requests; BYOK flows aren't covered. See https://vercel.com/docs/ai-gateway/capabilities/zdr for details.

Grok 3 Mini Fast Beta

Grok 3 Mini Fast Beta is the fastest and most cost-efficient model in xAI's Grok 3 family. It combines the compact Grok 3 Mini architecture with speed optimization for maximum throughput at the lowest per-token cost.

Tool Use

import { streamText } from 'ai'

const result = streamText({
  model: 'xai/grok-3-mini-fast',
  prompt: 'Why is the sky blue?'
})

Overview Playground About Providers Throughput Latency Uptime Status Similar FAQ

Frequently Asked Questions

What is Grok 3 Mini Fast Beta optimized for?
Maximum inference speed and minimum cost per token. It combines the compact Grok 3 Mini architecture with additional speed optimization for the highest throughput in the Grok 3 family.
How does Grok 3 Mini Fast Beta compare to Grok 3 Mini?
Grok 3 Mini Fast Beta adds latency optimization on top of the Grok 3 Mini architecture. It's faster but may trade marginal quality on complex reasoning tasks.
What is the context window?
131.1K tokens.
What does Grok 3 Mini Fast Beta cost?
Rates are listed on this page. They reflect the providers routing through AI Gateway and shift when providers update their pricing.
How do I authenticate with Grok 3 Mini Fast Beta through Vercel AI Gateway?
Use your Vercel AI Gateway API key with xai/grok-3-mini-fast as the model identifier. No separate xAI account is needed for gateway-managed access.
Can Grok 3 Mini Fast Beta handle code generation?
It handles basic code generation, boilerplate, and simple transformations. For complex coding tasks, Grok 3 Fast or Grok Code Fast 1 are better suited.
Does Vercel AI Gateway support Zero Data Retention for Grok 3 Mini Fast Beta?
Zero Data Retention is not currently available for this model. ZDR on AI Gateway applies to direct gateway requests; BYOK flows aren't covered. See https://vercel.com/docs/ai-gateway/capabilities/zdr for details.

AI Cloud

Core Platform

Security

Company

Learn

Open Source

Use Cases

Tools

Users

Grok 3 Mini Fast Beta

Frequently Asked Questions