Skip to content

Grok Imagine

Grok Imagine is xAI's video generation model. It creates video clips from text prompts and images with motion, generated audio, and lip-sync, available through Vercel AI Gateway.

image-to-videotext-to-videovideo-editingaudio-generation
index.ts
import { experimental_generateVideo as generateVideo } from 'ai';
const result = await generateVideo({
model: 'xai/grok-imagine-video',
prompt: 'A serene mountain lake at sunrise.'
});

More models by xAI

Model
Context
Latency
Throughput
Input
Output
Cache
Web Search
Per Query
Capabilities
Providers
ZDR
No Training
Release Date
256K
0.2s
199tps
$1.00/M
$2.00/M
Read:
$0.2/M
Write:
$5/K
+ input costs
xai logo
05/20/2026
1M
0.6s
174tps
$1.25/M
$2.50/M
Read:
$0.2/M
Write:
$5/K
+ input costs
xai logo
04/30/2026
2M
0.5s
223tps
$1.25/M
$2.50/M
Read:
$0.2/M
Write:
$5/K
+ input costs
vertex logo
xai logo
03/09/2026
2M
0.4s
115tps
$1.25/M
$2.50/M
Read:
$0.2/M
Write:
$5/K
+ input costs
vertex logo
xai logo
03/09/2026
1M
4.9s
190tps
$0.20/M$0.50/M
Read:$0.05/M
Write:
vertex logo
07/09/2025
1M
0.3s
72tps
$0.20/M$0.50/M
Read:$0.05/M
Write:
vertex logo
07/09/2025