Skip to content

Grok Imagine

Grok Imagine is xAI's video generation model. It creates video clips from text prompts and images with motion, generated audio, and lip-sync, available through Vercel AI Gateway.

image-to-videotext-to-videovideo-editingaudio-generation
index.ts
import { experimental_generateVideo as generateVideo } from 'ai';
const result = await generateVideo({
model: 'xai/grok-imagine-video',
prompt: 'A serene mountain lake at sunrise.'
});

More models by xAI

Model
Context
Latency
Throughput
Input
Output
Cache
Web Search
Per Query
Capabilities
Providers
ZDR
No Training
Release Date
1M
1.4s
70tps
$1.25/M
$2.50/M
Read:
$0.2/M
Write:
$5/K
+ input costs
xai logo
04/30/2026
2M
0.7s
50tps
$0.20/M
$0.50/M
Read:
$0.05/M
Write:
$5/K
+ input costs
xai logo
09/19/2025
256K
0.4s
87tps
$0.20/M$1.50/M
Read:$0.02/M
Write:
xai logo
08/28/2025
2M
0.2s
193tps
$0.20/M
$0.50/M
Read:
$0.05/M
Write:
$5/K
+ input costs
vertex logo
xai logo
07/09/2025
2M
0.7s
362tps
$0.20/M
$0.50/M
Read:
$0.05/M
Write:
$5/K
+ input costs
vertex logo
xai logo
07/09/2025
131K
0.3s
114tps
$0.30/M$0.50/M
Read:$0.07/M
Write:
xai logo
02/17/2025