Wan2.7
Wan2.7-R2V Reference-to-Video
- Alibaba Cloud Wanxiang 2.7 reference-to-video model
- Generate a new video with consistent style, characters, and scenes based on one or more reference images/videos
- Supports character consistency, style transfer, and multi-asset combination
- Supports reference voice (reference_voice) to control character voice
POST
Documentation Index
Fetch the complete documentation index at: https://gccai.heqingsong.uk/llms.txt
Use this file to discover all available pages before exploring further.
Authorization
All API endpoints require Bearer Token authenticationGet your API Key:Visit the API Key Management Page to get your API KeyAdd it to the request header:
Request Parameters
Video generation model name, fixed as
wan2.7-r2vVideo content description, up to 5000 charactersFor multiple images/videos, use “image 1”, “image 2”, “video 1”, etc. to reference the corresponding reference assets (in order of input)Example:
"The character in image 1 enters the scene in image 2 and looks around"Image array with roles; at least one of this and
video_urls must be providedFields for each object:url(string): image URLrole(string): image rolereference_image- reference image (default)first_frame- specified first frame (when provided,sizeis ignored and the aspect ratio follows the first-frame image)
reference_voice(string, optional): voice sample audio URL for that reference character, used to control the character’s voice in the generated video
Reference video URL array, up to 5 videos (total images + videos ≤ 5)At least one of this and
image_with_roles must be providedVideo constraints:
- Format: mp4, mov
- Duration: 1–30s
- Resolution: width and height in the range [240, 4096] pixels
- Aspect ratio: 1:8 – 8:1
- File size: up to 100MB
Negative prompt describing unwanted content, up to 500 characters
Video resolutionOptions:
720P- Standard1080P- High definition (default)
Video duration (seconds)Supported range:
2 ~ 15 secondsDefault: 5Aspect ratioSupported formats:
16:9- Landscape widescreen (default)9:16- Portrait1:1- Square4:3- Landscape3:4- Portrait
Whether to enable intelligent prompt rewritingSignificantly improves results for short prompts, but increases processing timeDefault:
trueWhether to add “AI Generated” watermark to the generated video
true: add watermarkfalse: no watermark (default)
Seed integer used to control the randomness of generated contentValue range: integer
≥0- For identical requests, the model generates different results when receiving different seed values (e.g., omitting seed)
- For identical requests, the model generates similar results when receiving the same seed value, but exact consistency is not guaranteed
Response
Response status code, 200 on success
Response data array
Use Cases
Case 1: Single Reference Image (Simplest)
Case 2: Multiple Reference Images
Case 3: Generation Based on Reference Video
Case 4: Specified First Frame + Reference Image
Case 5: Reference Image + Reference Voice (Precise)
Image Reference Rules
With multiple reference images, use numeric indices in theprompt to refer to them:
- 1st image → “image 1” or “the first image”
- 1st video → “video 1” or “the first video”
Query Task ResultsVideo generation is an async task that returns a
task_id upon submission. Use the Get Task Status endpoint to query generation progress and results.