Seedance v1.5 Pro
Seedance v1.5 Pro is ByteDance's first audio-visual joint generation video model, released December 16, 2025. It produces synchronized dialogue, sound effects, and ambient audio alongside 1080p video in one generation pass, with multilingual voice and regional dialect support.
import { experimental_generateVideo as generateVideo } from 'ai';
const result = await generateVideo({ model: 'bytedance/seedance-v1.5-pro', prompt: 'A serene mountain lake at sunrise.'});About Seedance v1.5 Pro
Seedance v1.5 Pro shifts the Seedance line from visual generation alone to joint audio-visual creation. Released December 16, 2025, it's the first Seedance model to generate voice, sound effects, and ambient audio synchronized to video in a single inference pass. You don't run a separate text-to-speech or audio compositing step.
The audio system supports multilingual speech generation across six languages: Chinese, English, Japanese, Korean, Spanish, and Indonesian. It also covers regional dialects such as Sichuanese and Cantonese. Vocal synthesis targets prosody and intonation that track the scene. Spatial reverb in sound effects matches the visual scene's physical context. ByteDance's release cites lip movement alignment, intonation patterning, and performance rhythm synchronization as focus areas versus listed baselines. See https://console.byteplus.com/ark/region:ark+ap-southeast-1/model/detail?Id=seedance-1-5-pro for tables and comparisons.
On the video side, Seedance v1.5 Pro raises the ceiling relative to Seedance 1.0 Pro. Where 1.0 focused on motion stability, 1.5 Pro extends camera control and finishing. You get cinematic camera controls including continuous long takes and dolly zooms, color grading controls, more facial detail in close-ups, and richer dynamic motion. Output supports 480p, 720p, and 1080p resolution at 24 fps, with clips from four to 12 seconds and seven aspect ratios.