Skip to content

Wan v2.6 Image-to-Video

Wan v2.6 Image-to-Video is Alibaba's image-to-video model that animates still images into high-fidelity video clips up to 1080p and 15 seconds, with optional audio and precise motion control from text guidance.

image-to-video
index.ts
import { experimental_generateVideo as generateVideo } from 'ai';
const result = await generateVideo({
model: 'alibaba/wan-v2.6-i2v',
prompt: 'A serene mountain lake at sunrise.'
});

Frequently Asked Questions

  • What image formats and sizes are accepted?

    The model accepts images between 360px and 2000px on each dimension, with a maximum file size of 100MB.

  • Can I control the direction of motion with a text prompt?

    Yes. A text prompt accompanying the image guides the generated motion, camera direction, and scene atmosphere.

  • Does Wan v2.6 Image-to-Video support audio output?

    Audio integration is optionally available, ambient sounds can accompany the generated video when enabled.

  • How long can generated clips be?

    Clips can be 5, 10, or 15 seconds long, making this the longest-output option in the I2V variants.

  • What aspect ratios are supported?

    The model supports 16:9, 9:16, 1:1, 4:3, and 3:4, the widest aspect ratio selection in the Wan 2.6 I2V lineup.

  • When should I choose I2V Flash instead?

    If you are prototyping or need fast feedback on motion ideas, the flash variant offers faster generation. For final-quality deliverables at 1080p, the standard I2V model is recommended.