HappyHorse 1.0
HappyHorse 1.0 Video Generation
- Alibaba Cloud Bailian HappyHorse 1.0 video generation model (unified entry, single-model auto-routing)
- Auto-routes by parameters: T2V (prompt only) / I2V (first_frame_image) / R2V (image_urls) / EDIT (video_url)
- Supports 720P/1080P resolutions and any integer duration from 3 to 15 seconds
- Billed by resolution × duration (seconds) only, regardless of capability
POST
Documentation Index
Fetch the complete documentation index at: https://gccai.heqingsong.uk/llms.txt
Use this file to discover all available pages before exploring further.
Authorization
All API endpoints require Bearer Token authenticationGet your API Key:Visit the API Key Management Page to get your API KeyAdd it to the request header:
Mode Routing
happyhorse-1.0 is the unified entry for Text-to-Video / Image-to-Video / Reference-Image-to-Video / Video Edit. The backend automatically determines the mode based on incoming parameters. All modes are billed by the same rule (resolution × seconds only):
| Fields you pass | Routes To | Mode Description |
|---|---|---|
prompt only | Text-to-Video (T2V) | Generate video purely from text |
prompt + first_frame_image | Image-to-Video (I2V) | Animate from a first-frame image |
prompt + image_urls (1–9 images) | Reference-Image-to-Video (R2V) | Generate a new scene from reference images |
prompt + video_url (optional image_urls 0–5 as style refs / audio_setting) | Video Edit (EDIT) | Rewrite / restylize a source video |
video_url > first_frame_image > image_urls > prompt only.
Mutual exclusion rules: the three media fields (first_frame_image / image_urls / video_url) are mutually exclusive in pairs. The only valid combination is video_url + image_urls (EDIT mode + reference images). Passing two mutually exclusive fields returns 400 mixed_media_not_allowed.
Request Parameters
Video generation model name, fixed as
happyhorse-1.0Video content description, up to 2500 characters; cannot contain special tokens
- T2V / R2V / EDIT modes: required
- I2V mode: optional, but recommended to guide camera movement and actions
"A little girl walking down the road, cinematic feel"First-frame image, triggers I2V (Image-to-Video). Supports URL or base64 (
data:image/<mime>;base64,<payload>, the gateway uploads it to OSS automatically)Mutually exclusive with image_urls / video_urlFirst-frame image requirements:
- Format: JPEG / JPG / PNG / BMP / WEBP
- Short side: ≥ 300px
- Aspect ratio:
1:2.5to2.5:1 - File size: ≤ 10MB
Image array:
- R2V mode (only
image_urlsprovided): 1–9 images, used as subject/style references to generate a new scene - EDIT mode (provided together with
video_url): 0–5 images, used as style reference
first_frame_image; can be combined with video_urlReference image requirements:
- Format: JPEG / JPG / PNG / BMP / WEBP
- Short side: ≥ 720p recommended
- Aspect ratio: short / long ≥ 0.4
- File size: ≤ 10MB
- Count: R2V must be 1–9; EDIT up to 5
Source video URL, triggers EDIT (Video Edit). Base64 is not supported — provide an HTTP/HTTPS direct linkMutually exclusive with
first_frame_image; can be combined with image_urls (≤ 5)Source video requirements:
- Duration: 3–60 seconds (> 15s will be auto-truncated by the upstream from 0 to 15s)
- Resolution: minimum 480p, short side ≥ 360
- Aspect ratio:
1:8to8:1 - Format: MP4 / MOV (H.264 recommended)
- Frame rate: > 8 fps
- File size: ≤ 100MB
Audio setting, only effective in EDIT mode (must pass
video_url)Options:auto- Auto-generate audio (default)origin- Keep the source video’s audio track
Video resolution (affects billing)Options:
720P- Standard1080P- High definition (default)
Video duration in seconds (affects billing)Supported range: any integer from
3 to 15Default: 5Aspect ratioSupported formats:
16:9- Landscape widescreen (default)9:16- Portrait1:1- Square4:3- Landscape3:4- Portrait
Whether to add a watermark to the generated video
true: Add watermarkfalse: Do not add watermark (default)
Random seed used to control the randomness of generated contentValue range:
[0, 2147483647]. If omitted, a random seed is used.- For identical requests, the model generates different results when receiving different seed values (e.g., omitting seed)
- For identical requests, the model generates similar results when receiving the same seed value, but exact consistency is not guaranteed
Response
Response status code, 200 on success
Response data array
Use Cases
Case 1: Text-to-Video T2V (Simplest Request)
Case 2: Text-to-Video T2V (Full Parameters)
Case 3: Image-to-Video I2V (first_frame_image)
Case 4: Reference-Image-to-Video R2V (multiple references)
Case 5: Video Edit EDIT (keep original audio + style reference)
Case 6: 720P to Save Cost
Mode Selection Guide
| Requirement | Recommended Approach |
|---|---|
| Generate video from text only | Pass only prompt (T2V) |
| Make an image “come alive” (use it as the first frame) | Pass first_frame_image (I2V) |
| Generate a new scene from a set of reference images | Pass image_urls (1–9, R2V) |
| Rewrite / restylize an existing video | Pass video_url (EDIT), optionally combine with image_urls (0–5) as style refs |
| Save cost | Use resolution: "720P" |
Usage Tips
- Unified entry logic: input fields decide the mode. Note that the three media fields (
first_frame_image/image_urls/video_url) are mutually exclusive in pairs sizeonly effective in T2V/R2V: in I2V / EDIT modessizeis ignored — the output aspect ratio is determined by the input media- Duration: 5–10 seconds is the sweet spot. Too short causes choppy motion; too long significantly increases upstream processing time
- First-frame image quality: clear, well-composed, subject centered — significantly improves I2V output
- Prompt writing: describe motion / camera / atmosphere (e.g., “slow push-in, cinematic, warm tones”) for better results than purely static scene descriptions
- EDIT input video: > 15 seconds will be auto-truncated by the upstream from 0 to 15s. If you need other segments, slice the video yourself first
Query Task ResultsVideo generation is an async task that returns a
task_id upon submission. Use the Get Task Status endpoint to query generation progress and results.