Skip to content

Wan v2.6 Reference-to-Video Flash

Wan v2.6 Reference-to-Video Flash is Alibaba's fast reference-to-video model that preserves subject identity from video references and generates new scenes at speed, supporting 720p and 1080p output for rapid creative iteration.

reference-to-video
index.ts
import { experimental_generateVideo as generateVideo } from 'ai';
const result = await generateVideo({
model: 'alibaba/wan-v2.6-r2v-flash',
prompt: 'A serene mountain lake at sunrise.'
});

Frequently Asked Questions

  • What makes R2V Flash different from the standard R2V model?

    Flash is speed-optimized. It generates reference-consistent video much faster than the standard R2V at a potential tradeoff in peak identity fidelity. For drafts and iteration, Flash is preferred; for final output, the standard R2V model is recommended.

  • Does R2V Flash support the same reference video format as standard R2V?

    Yes. Both accept the same reference URL lists and prompt conventions: use character1, character2, and so on in the prompt, in URL order, with 2 to 30 seconds per video reference where applicable.

  • What resolutions does R2V Flash support?

    720p and 1080p. The R2V variants don't include a 480p option.

  • What is the maximum generated video length?

    Output duration is 2 to 10 seconds for Wan R2V on AI Gateway. The 15-second option available on some T2V and I2V models does not apply here.

  • Can R2V Flash handle multiple characters from different reference clips in one scene?

    Yes. You can combine several reference URLs in one request (within provider limits) and name them character1, character2, and so on in the prompt.

  • Is audio included in generated output?

    Voice and audio characteristics captured from the reference clips are part of the identity extraction process; check provider-level documentation for specific audio output behavior.