Skip to content
Vercel April 2026 security incident

Grok 4.20 Beta Non-Reasoning

xai/grok-4.20-non-reasoning-beta

Grok 4.20 Beta Non-Reasoning is xAI's non-reasoning model in the Grok 4.20 beta generation, optimized for speed and direct responses with low hallucination rates and strict prompt adherence.

Tool UseImplicit CachingVision (Image)File Input
index.ts
import { streamText } from 'ai'
const result = streamText({
model: 'xai/grok-4.20-non-reasoning-beta',
prompt: 'Why is the sky blue?'
})

What To Consider When Choosing a Provider

  • Zero Data Retention

    AI Gateway does not currently support Zero Data Retention for this model. See the documentation for models that support ZDR.

    Authentication

    AI Gateway authenticates requests using an API key or OIDC token. You do not need to manage provider credentials directly.

Grok 4.20 Beta Non-Reasoning is in beta. Expect potential changes to behavior, pricing, or availability before general availability.

This variant produces direct answers. If you need the model to reason through complex problems step by step, use the Grok 4.20 Reasoning variant instead.

When to Use Grok 4.20 Beta Non-Reasoning

Best For

  • High-throughput production APIs:

    Direct, precise answers at low latency serve end users best

  • Agentic tool-calling workflows

    That benefit from fast decision-making with low hallucination rates

  • Classification and routing pipelines

    That need reliable, prompt-adherent output for downstream processing

  • Chat and conversational interfaces:

    Low-hallucination, prompt-adherent responses arrive quickly without chain-of-thought overhead

  • Content generation tasks

    Where strict prompt adherence matters more than deep reasoning

Consider Alternatives When

  • Complex analytical tasks

    Requiring multi-step reasoning. Use the Grok 4.20 Reasoning variant

  • Multi-agent orchestration:

    The Grok 4.20 Multi-Agent variant is purpose-built for agent collaboration

  • Stable production deployments:

    Beta models introduce unwanted risk. Use Grok 4.1 Fast Non-Reasoning instead

  • Maximum cost efficiency on simple tasks:

    Grok 3 Mini Fast offers lower per-token costs

Conclusion

Grok 4.20 Beta Non-Reasoning trades reasoning depth for speed. Use it in agentic pipelines where fast, direct responses matter more than extended deliberation. For production stability, prefer the non-beta grok-4.20-non-reasoning.

FAQ

The model produces direct answers without generating chain-of-thought reasoning traces. This reduces latency and output token cost compared to the reasoning variant.

2M tokens.

Yes. Grok 4.20 Beta Non-Reasoning is currently in beta. Behavior, pricing, or availability may change before general availability.

Current pricing is shown on this page. AI Gateway routes across providers, and rates may vary by provider.

Use your Vercel AI Gateway API key with xai/grok-4.20-non-reasoning-beta as the model identifier. No separate xAI account is needed for gateway-managed access.

Grok 4.20 Beta Non-Reasoning is the newer generation with improved hallucination rates and prompt adherence. Grok 4.1 Fast Non-Reasoning is a stable, generally available alternative if you prefer to avoid beta models.

Zero Data Retention is not currently available for this model. ZDR on AI Gateway applies to direct gateway requests; BYOK flows aren't covered. See https://vercel.com/docs/ai-gateway/capabilities/zdr for details.