kling-video
Generate AI videos with Kuaishou Kling via AceDataCloud API. Use when creating videos from text or images, extending existing videos, or applying motion control. Supports text-to-video, image-to-video, extend, and motion generation with multiple models and quality modes.
What it does
Kling Video Generation
Generate AI videos through AceDataCloud's Kuaishou Kling API.
Setup: See authentication for token setup.
Quick Start
curl -X POST https://api.acedata.cloud/kling/videos \
-H "Authorization: Bearer $ACEDATACLOUD_API_TOKEN" \
-H "Content-Type: application/json" \
-d '{"action": "text2video", "prompt": "a cat playing piano on a rooftop at sunset", "model": "kling-v3", "mode": "std", "duration": 5}'
Async: See async task polling. Poll via
POST /kling/taskswith{"task_id": "..."}.
Models
| Model | Quality | Best For |
|---|---|---|
kling-v3 | Latest | Best quality, flexible 3–15s duration, optional audio generation |
kling-v3-omni | Latest | Omni model with audio support, flexible 3–15s duration |
kling-v2-6 | High | High-quality output with optional audio (pro mode) |
kling-v2-5-turbo | High + Fast | Best speed/quality trade-off |
kling-v2-master | High | High-quality output |
kling-v2-1-master | High | Improved v2 |
kling-v1-6 | Improved | Better quality than v1 |
kling-v1 | Standard | Basic generation, lowest cost |
kling-video-o1 | Premium | Highest quality (thinking model) |
Quality Modes
| Mode | Speed | Cost | Use For |
|---|---|---|---|
std (Standard) | Slower | Lower | Draft/preview |
pro (Professional) | Faster | Higher | Final output |
4k (Native 4K) | — | Premium | Native 4K output — only kling-v3 and kling-v3-omni; incompatible with camera_control |
Workflows
1. Text-to-Video
POST /kling/videos
{
"action": "text2video",
"prompt": "a futuristic city with flying cars",
"model": "kling-v3",
"mode": "std",
"duration": 5,
"aspect_ratio": "16:9"
}
2. Image-to-Video
Animate a still image. Optionally specify an ending frame.
POST /kling/videos
{
"action": "image2video",
"prompt": "the scene slowly comes alive with movement",
"start_image_url": "https://example.com/scene.jpg",
"end_image_url": "https://example.com/end-scene.jpg",
"model": "kling-v3",
"mode": "pro"
}
3. Extend Video
Continue an existing video with additional seconds.
POST /kling/videos
{
"action": "extend",
"video_id": "existing-video-id",
"prompt": "the camera pulls back to reveal the full landscape",
"model": "kling-v2-5-turbo"
}
4. Motion Control
Apply precise camera/motion control from an image + reference video.
POST /kling/motion
{
"image_url": "https://example.com/subject.jpg",
"video_url": "https://example.com/motion-reference.mp4"
}
Parameters
| Parameter | Values | Description |
|---|---|---|
action | "text2video", "image2video", "extend" | Generation mode |
model | See models table | Model to use |
mode | "std", "pro", "4k" | Quality mode (4k only for kling-v3 / kling-v3-omni, incompatible with camera_control) |
duration | 5, 10 (v3/v3-omni: 3–15) | Duration in seconds |
generate_audio | true, false | Generate audio with video (v3, v3-omni, v2-6 pro only) |
aspect_ratio | "16:9", "9:16", "1:1" | Video aspect ratio |
cfg_scale | 0–1 | Prompt relevance strength |
negative_prompt | string | What to avoid in the video |
camera_control | object | Camera movement parameters |
element_list | array | Reference subjects from the element library (each item has element_id). Combined with video_list, total reference images + subjects ≤ 7 (or ≤ 4 if a reference video is included) |
video_list | array | Reference video(s) via video_url (MP4/MOV, 3–10s, ≤200MB, max 1 video). Each item has video_url, refer_type ("feature" or "base"), and optional keep_original_sound |
callback_url | string | Async callback URL |
Gotchas
durationsupports5or10seconds for most models;kling-v3andkling-v3-omnisupport flexible3–15secondsmode=4kis only available forkling-v3andkling-v3-omniand is incompatible withcamera_controlgenerate_audioenables synchronized audio generation (supported bykling-v3,kling-v3-omni, andkling-v2-6in pro mode)end_image_urlis only forimage2videoaction — it defines the last frame- Motion control (
/kling/motion) is a separate endpoint from video generation promode costs roughly 2xstdmode but generates faster with better quality- Task states use
"succeed"(not "succeeded") — check for this value when polling negative_prompthelps avoid unwanted elements (e.g., "blurry, low quality, text")
Capabilities
Install
Quality
deterministic score 0.45 from registry signals: · indexed on github topic:agent-skills · 7 github stars · SKILL.md body (4,689 chars)