Grok 3 Mini Fast Beta is the speed-optimized configuration of the Grok 3 Mini model, released February 17, 2025. It represents the most cost-efficient option in the Grok 3 lineup, combining the compact architecture of Grok 3 Mini with additional inference optimization for maximum tokens-per-second throughput.
With a context window of 131.1K tokens, the model handles standard language tasks including classification, extraction, summarization, and basic code generation. It's engineered for workloads where volume and speed dominate the requirements, rather than deep reasoning or analytical depth.
Grok 3 Mini Fast Beta is available at $0.6 per million input tokens and $4.0 per million output tokens through Vercel AI Gateway. At this price point, it enables use cases that would be cost-prohibitive with larger models, such as processing millions of records per day or powering high-traffic consumer applications.