Skip to content

Kling v2.6 Motion Control

Kling v2.6 Motion Control transfers full-body motion from a 3-30 second reference clip to a generated scene, capturing gestures, facial expressions, lip-sync, and camera movement with frame-accurate fidelity.

Video Gen
index.ts
import { experimental_generateVideo as generateVideo } from 'ai';
const result = await generateVideo({
model: 'klingai/kling-v2.6-motion-control',
prompt: 'A serene mountain lake at sunrise.'
});

Playground

Try out Kling v2.6 Motion Control by Kling AI. Usage is billed to your team at API rates. Free users (those who haven't made a payment) get $5 of credits every 30 days.

Providers

Route requests across multiple providers. Copy a provider slug to set your preference. Visit the docs for more info. Using a provider means you agree to their terms, listed under Legal.

Provider
Context
Latency
Throughput
Input
Output
Cache
Web Search
Per Query
Capabilities
ZDR
No Training
Release Date
Kling AI
Legal:Terms
Privacy
12/21/2025

More models by Kling AI

Model
Context
Latency
Throughput
Input
Output
Cache
Web Search
Per Query
Capabilities
Providers
ZDR
No Training
Release Date

About Kling v2.6 Motion Control

Kling v2.6 Motion Control is a video-to-video generation model built around Total Motion Transfer: replicating a motion sequence from a reference clip in an entirely new generated scene. You provide a 3-30 second reference video containing the movement to capture. The model applies that motion to a new subject and setting defined by a text or image prompt.

The system handles fast, intricate actions with high frame-level fidelity. Martial arts sequences, dance routines, and other high-speed movements that challenge basic motion estimation render with reduced artifacts in hand regions. Hand articulation has historically been a weak point in motion transfer systems. Facial expression tracking and lip-sync alignment carry over from the reference, making the model suitable for character animation and talking-head video production.

Kling v2.6 Motion Control also extracts camera behavior from the reference clip. Panning, push-in, pull-out, and rotation moves replicate in the generated output. A reference shot with deliberate camera motion carries that staging into the new scene, not only the actor motion. Output duration extends up to 30 seconds without manual clip stitching.

What To Consider When Choosing a Provider

  • Configuration: Reference video quality drives transfer accuracy. Use clear subjects, stable framing, and well-lit motion.
  • Zero Data Retention: AI Gateway does not currently support Zero Data Retention for this model. See the documentation for models that support ZDR.
  • Authentication: AI Gateway authenticates requests using an API key or OIDC token. You do not need to manage provider credentials directly.

When to Use Kling v2.6 Motion Control

Best For

  • Dance and performance content: A reference performance transfers to a generated character or setting
  • Talking-head dialogue: Videos that need accurate facial expression and lip-sync replication
  • Camera movement transfer: Camera behavior from a reference shot carries into the new scene
  • Fashion and product video: Production using human movement references in controlled studio footage

Consider Alternatives When

  • No reference video: You don't have a reference motion video and prefer purely text-driven generation
  • Simple image animation: You want to animate an image into short video without motion transfer, so see i2v variants
  • Multi-shot narratives: You need narrative generation with independent scene segments, so see v3.0

Conclusion

Kling v2.6 Motion Control transfers a precise movement sequence to a generated scene without manual animation or frame-by-frame keyframing. For character performance, staged shots, dance, or action, it moves hands, face, and camera together from the reference.

Frequently Asked Questions

  • What format and duration should the reference video be?

    The reference video should be 3-30 seconds long. Clearer subject visibility and stable framing produce more accurate motion transfer in the output.

  • Does Kling v2.6 Motion Control also transfer camera movement from the reference video?

    Yes. Camera behavior (pan, push, pull, and rotation) in the reference clip replicates in the generated video, not just subject body motion.

  • How does it handle fast or complex movements like martial arts or dance?

    The model reduces artifacts on fast, intricate motions. Hand articulation and high-speed body movements render with improved fidelity compared to earlier motion transfer approaches.

  • What is the maximum output duration?

    Outputs can reach up to 30 seconds. This eliminates the need to stitch multiple short clips together for longer sequences.

  • Can the model transfer facial expressions and lip-sync from the reference video?

    Yes. Facial expression tracking and lip-sync alignment transfer from the reference and apply to the generated subject.

  • Is a text prompt required alongside the reference video?

    A text prompt is optional. Use it to describe the desired scene, subject, and styling for the output. Motion derives from the reference clip while the prompt defines what appears in the new video.