Build upon an All-in-One product framework, the Kling 3.0 model series supports full multimodal input and output spanning text, images, audio, and video, bringing the understanding, generation, and editing of video together in one streamlined AI workflow. The models integrate multiple tasks, including text-to-video, image-to-video, reference-to-video, and in-video editing, into a single, native multimodal architecture, enabling the models to follow complex narrative logic, deliver precise shot control, and maintain strong prompt adherence.
import { experimental_generateVideo as generateVideo } from 'ai';
const result = await generateVideo({ model: 'klingai/kling-v3.0-i2v', prompt: 'A serene mountain lake at sunrise.'});Try out Kling v3.0 Image-to-Video by Kling AI. Usage is billed to your team at API rates. Free users get $5 of credits every 30 days, and you are considered a free user if you haven't made a payment.
Chat with
Powered by AI Gateway
| Model |
|---|
Context | Image Gen | Video Gen | Max Output | Latency | Throughput | Input | Output | Cache | Web Search | Capabilities | Providers |
|---|