What format and duration should the reference video be?

The reference video should be 3-30 seconds long. Clearer subject visibility and stable framing produce more accurate motion transfer in the output.

Does Kling v2.6 Motion Control also transfer camera movement from the reference video?

Yes. Camera behavior (pan, push, pull, and rotation) in the reference clip replicates in the generated video, not just subject body motion.

How does it handle fast or complex movements like martial arts or dance?

The model reduces artifacts on fast, intricate motions. Hand articulation and high-speed body movements render with improved fidelity compared to earlier motion transfer approaches.

What is the maximum output duration?

Outputs can reach up to 30 seconds. This eliminates the need to stitch multiple short clips together for longer sequences.

Can the model transfer facial expressions and lip-sync from the reference video?

Yes. Facial expression tracking and lip-sync alignment transfer from the reference and apply to the generated subject.

Is a text prompt required alongside the reference video?

A text prompt is optional. Use it to describe the desired scene, subject, and styling for the output. Motion derives from the reference clip while the prompt defines what appears in the new video.

Dashboard

Kling v2.6 Motion Control

Kling v2.6 Motion Control transfers full-body motion from a 3-30 second reference clip to a generated scene, capturing gestures, facial expressions, lip-sync, and camera movement with frame-accurate fidelity.

Video Gen

index.ts

import { experimental_generateVideo as generateVideo } from 'ai';

const result = await generateVideo({
  model: 'klingai/kling-v2.6-motion-control',
  prompt: 'A serene mountain lake at sunrise.'
});

Overview Playground About Providers Similar FAQ

Playground

Try out Kling v2.6 Motion Control by Kling AI. Usage is billed to your team at API rates. Free users (those who haven't made a payment) get $5 of credits every 30 days.

Providers

Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.

Provider

Context	Latency	Throughput	Input	Output	Cache	Web Search	Per Query	Capabilities	ZDR	No Training	Release Date

Legal:Terms

•

Privacy

—

12/21/2025

More models by Kling AI

Model

Context	Latency	Throughput	Input	Output	Cache	Web Search	Per Query	Capabilities	Providers	ZDR	No Training	Release Date

—

03/04/2026

—

02/05/2026

—

12/21/2025

—

12/21/2025

—

09/23/2025

—

09/23/2025

About Kling v2.6 Motion Control

Kling v2.6 Motion Control is a video-to-video generation model built around Total Motion Transfer: replicating a motion sequence from a reference clip in an entirely new generated scene. You provide a 3-30 second reference video containing the movement to capture. The model applies that motion to a new subject and setting defined by a text or image prompt.

The system handles fast, intricate actions with high frame-level fidelity. Martial arts sequences, dance routines, and other high-speed movements that challenge basic motion estimation render with reduced artifacts in hand regions. Hand articulation has historically been a weak point in motion transfer systems. Facial expression tracking and lip-sync alignment carry over from the reference, making the model suitable for character animation and talking-head video production.

Kling v2.6 Motion Control also extracts camera behavior from the reference clip. Panning, push-in, pull-out, and rotation moves replicate in the generated output. A reference shot with deliberate camera motion carries that staging into the new scene, not only the actor motion. Output duration extends up to 30 seconds without manual clip stitching.

What To Consider When Choosing a Provider

Configuration: Reference video quality drives transfer accuracy. Use clear subjects, stable framing, and well-lit motion.
Zero Data Retention: AI Gateway does not currently support Zero Data Retention for this model. See the documentation for models that support ZDR.
Authentication: AI Gateway authenticates requests using an API key or OIDC token. You do not need to manage provider credentials directly.

When to Use Kling v2.6 Motion Control

Best For

Dance and performance content: A reference performance transfers to a generated character or setting
Talking-head dialogue: Videos that need accurate facial expression and lip-sync replication
Camera movement transfer: Camera behavior from a reference shot carries into the new scene
Fashion and product video: Production using human movement references in controlled studio footage

Consider Alternatives When

No reference video: You don't have a reference motion video and prefer purely text-driven generation
Simple image animation: You want to animate an image into short video without motion transfer, so see i2v variants
Multi-shot narratives: You need narrative generation with independent scene segments, so see v3.0

Conclusion

Kling v2.6 Motion Control transfers a precise movement sequence to a generated scene without manual animation or frame-by-frame keyframing. For character performance, staged shots, dance, or action, it moves hands, face, and camera together from the reference.

Frequently Asked Questions

What format and duration should the reference video be?
The reference video should be 3-30 seconds long. Clearer subject visibility and stable framing produce more accurate motion transfer in the output.
Does Kling v2.6 Motion Control also transfer camera movement from the reference video?
Yes. Camera behavior (pan, push, pull, and rotation) in the reference clip replicates in the generated video, not just subject body motion.
How does it handle fast or complex movements like martial arts or dance?
The model reduces artifacts on fast, intricate motions. Hand articulation and high-speed body movements render with improved fidelity compared to earlier motion transfer approaches.
What is the maximum output duration?
Outputs can reach up to 30 seconds. This eliminates the need to stitch multiple short clips together for longer sequences.
Can the model transfer facial expressions and lip-sync from the reference video?
Yes. Facial expression tracking and lip-sync alignment transfer from the reference and apply to the generated subject.
Is a text prompt required alongside the reference video?
A text prompt is optional. Use it to describe the desired scene, subject, and styling for the output. Motion derives from the reference clip while the prompt defines what appears in the new video.

AI Cloud

Core Platform

Security

Company

Learn

Open Source

Use Cases

Tools

Users

Kling v2.6 Motion Control

Playground

Providers

More models by Kling AI

About Kling v2.6 Motion Control

What To Consider When Choosing a Provider

When to Use Kling v2.6 Motion Control

Best For

Consider Alternatives When

Conclusion

Frequently Asked Questions