Kling 2.6
Kling 2.6 Video Generation
- Async processing mode, returns task ID for subsequent queries
- Supports text-to-video, image-to-video (first frame/first-last frame control)
- Supports standard mode (720P) and professional mode (1080P)
- Professional mode supports automatic audio generation and voice selection
POST
Documentation Index
Fetch the complete documentation index at: https://gccai.heqingsong.uk/llms.txt
Use this file to discover all available pages before exploring further.
Authorization
All API endpoints require Bearer Token authenticationGet your API Key:Visit the API Key Management Page to get your API KeyAdd it to the request header:
Request Parameters
Video generation model nameSupported models:
kling-v2-6- Kling v2.6 (recommended)
Text prompt, maximum 2500 charactersDescribe scenes, actions, styles in detail for better generation resultsExample:
"A golden cat running on a sunlit meadow, slow motion, cinematic quality"Generation modeOptions:
std- Standard mode (720P, silent video only)pro- Professional mode (1080P, supports automatic audio generation)
stdVideo duration (seconds)Options:
5 or 10Default: 5Video aspect ratioOptions:
16:9- Landscape9:16- Portrait1:1- Square
16:9Negative prompt to exclude unwanted contentExample:
"blurry, low quality, distorted"Image URL array for image-to-video generation
- Pass 1 image: used as first frame
- Pass 2 images: automatically assigned as first frame + last frame (requires
mode: "pro")
["https://example.com/first.jpg"]Whether to automatically generate audioDefault:
falseWhether to add watermark
Feature Support Matrix
| Type | Feature | std 5s | std 10s | pro 5s | pro 10s |
|---|---|---|---|---|---|
| Text-to-Video | Generation | ✅ (silent only) | ✅ (silent only) | ✅ | ✅ |
| Text-to-Video | Auto Audio | - | - | ✅ | ✅ |
| Image-to-Video | Generation | ✅ (silent only) | ✅ (silent only) | ✅ | ✅ |
| Image-to-Video | First Frame | ✅ | ✅ | ✅ | ✅ |
| Image-to-Video | Last Frame | - | - | ✅ | ✅ |
| Image-to-Video | Auto Audio | - | - | ✅ | ✅ |
Note: In pro mode, last frame and audio control are mutually exclusive and cannot be used together.
Text-to-Video vs Image-to-Video
The system automatically determines the mode based on whetherimage_urls is provided: no images means text-to-video, with images means image-to-video.
| Parameter | Text-to-Video | Image-to-Video |
|---|---|---|
prompt | ✅ Required | ✅ Required |
image_urls | ❌ Not used | ✅ Required (1-2 images, last frame needs pro) |
negative_prompt | ✅ Optional | ✅ Optional |
mode | ✅ Optional | ✅ Optional |
duration | ✅ Optional | ✅ Optional |
aspect_ratio | ✅ Optional | ⚠️ May be overridden by image ratio |
audio | ✅ Optional (needs pro) | ✅ Optional (needs pro) |
watermark | ✅ Optional | ✅ Optional |
Response
Response status code, 200 on success
Response data array
Use Cases
Case 1: Text-to-Video (Standard Mode)
Case 2: Text-to-Video (Pro Mode + Negative Prompt)
Case 3: Image-to-Video (First Frame)
Case 4: Image-to-Video (First + Last Frame Control)
Case 5: Pro Mode + Auto Audio
Query Task ResultsVideo generation is an async task that returns a
task_id upon submission. Use the Get Task Status endpoint to query generation progress and results.