Kling v3.0 Text-to-Video

Kling v3.0 Text-to-Video is Kling's v3.0 text-to-video model with multi-shot narrative generation, physics-aware motion, native multilingual audio, and up to 15-second output from a single prompt.

text-to-videomulti-shotaudio-generation

import { experimental_generateVideo as generateVideo } from 'ai';

const result = await generateVideo({
  model: 'klingai/kling-v3.0-t2v',
  prompt: 'A serene mountain lake at sunrise.'
});

Overview Playground About Providers Throughput Latency Similar FAQ

Latency24 hours

P50 time to first token (TTFT) on live AI Gateway traffic, in milliseconds. See the docs for more information.

AI Cloud

Core Platform

Security

Company

Learn

Open Source

Use Cases

Tools

Users

Kling v3.0 Text-to-Video