Grok Imagine

Grok Imagine is xAI's video generation model. It creates video clips from text prompts and images with motion, generated audio, and lip-sync, available through Vercel AI Gateway. Your use subject to xAI's Terms & Privacy Policies.

image-to-videotext-to-videovideo-editingaudio-generation

Use with AI Gateway View docs

index.ts

import { experimental_generateVideo as generateVideo } from 'ai';

const result = await generateVideo({
  model: 'xai/grok-imagine-video',
  prompt: 'A serene mountain lake at sunrise.'
});

Overview About Providers Similar FAQ

Playground

Try out Grok Imagine by xAI. Usage is billed to your team at API rates. Free users (those who haven't made a payment) get $5 of credits every 30 days.

Grok Imagine

Images(optional)

Add up to 5 images

Video to edit(optional)

Prompt(optional)

Duration8s

Resolution

480p

720p

Aspect ratio

Videos to generate

Your generated video will appear here.

Providers

Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.

Provider

Input	Output	Capabilities	ZDR	No Training	Release Date

xAI

Legal:Terms•Privacy

$0.002/img$0.01/sec

$0.05/sec+1 more

01/28/2026

More models by xAI

Model

Context	Latency	Throughput	Input	Output	Cache	Web Search	Capabilities	Providers	ZDR	No Training	Release Date

xai/grok-4.5

500K

1.6s

69tps

$2/M

$6/M

Read:

$0.3/M

Write:

—

$5/K

+ input costs

07/08/2026

xai/grok-build-0.1

256K

0.4s

140tps

$1/M

$2/M

Read:

$0.2/M

Write:

—

$5/K

+ input costs

05/20/2026

xai/grok-4.3

1.0s

108tps

$1.25/M

$2.50/M

Read:

$0.2/M

Write:

—

$5/K

+ input costs

04/30/2026

xai/grok-4.20-non-reasoning

0.3s

126tps

$1.25/M

$2.50/M

Read:

$0.2/M

Write:

—

$5/K

+ input costs

03/10/2026

xai/grok-4.1-fast-non-reasoning

0.4s

170tps

$0.20/M

$0.50/M

Read:$0.05/M

Write:—

—

11/19/2025

xai/grok-4.1-fast-reasoning

1.0s

124tps

$0.20/M

$0.50/M

Read:$0.05/M

Write:—

—

11/19/2025

About Grok Imagine

Grok Imagine is xAI's video generation model, released January 28, 2026 and available through Vercel AI Gateway. It generates video clips from text descriptions and static images with motion, instruction following, and support for complex prompts and follow-up instructions to refine scenes.

The model supports three primary generation modes: text-to-video (creating clips from text descriptions), image-to-video (generating motion from static images), and video editing (modifying existing video content through style changes, object replacement, and scene alterations). It also generates audio timed to the video with lip-sync, so you can skip separate voice recording for many workflows. Grok Imagine produces short clips quickly enough for iterative creative workflows. You can call it from the AI SDK's generateVideo function, the AI Gateway playground at https://ai-sdk.dev/playground/xai:grok-imagine-video, or the v0 Grok Creative Studio. Video generation is currently available to Pro and Enterprise plan subscribers and paid AI Gateway users.

What To Consider When Choosing a Provider

Configuration: Video generation is currently limited to Pro and Enterprise plans, as well as paid AI Gateway users. Verify your plan supports video generation before integrating.
Configuration: Grok Imagine understands follow-up instructions to tweak scenes. Use iterative prompting to refine output rather than trying to get the perfect result in a single generation.
Zero Data Retention: AI Gateway does not currently support Zero Data Retention for this model. See the documentation for models that support ZDR.
Authentication: AI Gateway authenticates requests using an API key or OIDC token. You do not need to manage provider credentials directly.

When to Use Grok Imagine

Best for

Marketing and social media video content: Where custom clips are needed at scale without traditional video production
Product demos and explainer videos: That visualize concepts, features, or workflows through generated scenes
Creative prototyping and storyboarding: Teams iterate on visual concepts before committing to full production
Content creation pipelines: That generate short video assets programmatically for personalization or A/B testing
Lip-synced video with native audio: For talking-head content, presentations, or character-driven narratives

Consider alternatives when

Static image generation: Grok Imagine Image or Grok Imagine Image Pro handles the task without video overhead
Long-form video production: Traditional editing tools provide more control over extended content
Free-tier usage: Video generation currently requires a paid plan

Conclusion

Grok Imagine brings AI video generation into the Vercel AI Gateway ecosystem. It supports text-to-video, image-to-video, video editing, and audio in one pipeline. Iterative prompting and short clip latency fit production workflows that need custom video without full traditional production.

Agent Stack

Core Platform

Tools

Learn

Build

Explore

Grok Imagine

Playground

Providers

More models by xAI

About Grok Imagine

What To Consider When Choosing a Provider

When to Use Grok Imagine

Best for

Consider alternatives when

Conclusion

Advanced

Best for

Consider alternatives when