Seedance v1.5 Pro shifts the Seedance line from visual generation alone to joint audio-visual creation. Released December 16, 2025, it's the first Seedance model to generate voice, sound effects, and ambient audio synchronized to video in a single inference pass. You don't run a separate text-to-speech or audio compositing step.
The audio system supports multilingual speech generation across six languages: Chinese, English, Japanese, Korean, Spanish, and Indonesian. It also covers regional dialects such as Sichuanese and Cantonese. Vocal synthesis targets prosody and intonation that track the scene. Spatial reverb in sound effects matches the visual scene's physical context. ByteDance's release cites lip movement alignment, intonation patterning, and performance rhythm synchronization as focus areas versus listed baselines. See https://console.byteplus.com/ark/region:ark+ap-southeast-1/model/detail?Id=seedance-1-5-pro for tables and comparisons.
On the video side, Seedance v1.5 Pro raises the ceiling relative to Seedance 1.0 Pro. Where 1.0 focused on motion stability, 1.5 Pro extends camera control and finishing. You get cinematic camera controls including continuous long takes and dolly zooms, color grading controls, more facial detail in close-ups, and richer dynamic motion. Output supports 480p, 720p, and 1080p resolution at 24 fps, with clips from four to 12 seconds and seven aspect ratios.