Skip to content

Wan v2.6 Text-to-Video

Wan v2.6 Text-to-Video is the production-grade text-to-video model in Alibaba Cloud's Wan series, generating cinematic clips up to 15 seconds with automatic multi-shot scene composition and native audio at resolutions up to 1080p.

text-to-video

index.ts

import { experimental_generateVideo as generateVideo } from 'ai';

const result = await generateVideo({
  model: 'alibaba/wan-v2.6-t2v',
  prompt: 'A serene mountain lake at sunrise.'
});

Overview About Providers Similar FAQ

More models by Alibaba Cloud

Model

Context	Latency	Throughput	Input	Output	Cache	Web Search	Capabilities	Providers	ZDR	No Training	Release Date

alibaba/qwen3.7-plus

1M

2.9s

356tps

$0.32/M

$1.28/M

Read:$0.08/M

Write:$0.5/M

—

+2

06/02/2026

alibaba/qwen3.7-max

991K

2.5s

55tps

$1.25/M

$3.75/M

Read:$0.25/M

Write:$1.56/M

—

05/21/2026

alibaba/qwen-3.6-max-preview

240K

1.9s

105tps

$1.30/M

$7.80/M

Read:

$0.26/M

Write:

$1.63/M

—

04/20/2026

alibaba/qwen3.6-plus

1M

1.2s

110tps

$0.50/M

$3/M

Read:

$0.1/M

Write:

$0.63/M

—

+1

04/02/2026

alibaba/qwen3.5-flash

1M

0.9s

235tps

$0.10/M

$0.40/M

Read:$0.0/M

Write:$0.13/M

—

+1

02/24/2026

alibaba/qwen3-embedding-4b

33K

$0.02/M

—

06/05/2025