Wan v2.6 Image-to-Video
Wan v2.6 Image-to-Video is Alibaba's image-to-video model that animates still images into high-fidelity video clips up to 1080p and 15 seconds, with optional audio and precise motion control from text guidance.
import { experimental_generateVideo as generateVideo } from 'ai';
const result = await generateVideo({ model: 'alibaba/wan-v2.6-i2v', prompt: 'A serene mountain lake at sunrise.'});Frequently Asked Questions
What image formats and sizes are accepted?
The model accepts images between 360px and 2000px on each dimension, with a maximum file size of 100MB.
Can I control the direction of motion with a text prompt?
Yes. A text prompt accompanying the image guides the generated motion, camera direction, and scene atmosphere.
Does Wan v2.6 Image-to-Video support audio output?
Audio integration is optionally available, ambient sounds can accompany the generated video when enabled.
How long can generated clips be?
Clips can be 5, 10, or 15 seconds long, making this the longest-output option in the I2V variants.
What aspect ratios are supported?
The model supports 16:9, 9:16, 1:1, 4:3, and 3:4, the widest aspect ratio selection in the Wan 2.6 I2V lineup.
When should I choose I2V Flash instead?
If you are prototyping or need fast feedback on motion ideas, the flash variant offers faster generation. For final-quality deliverables at 1080p, the standard I2V model is recommended.