VEO3
VEO3 Official Video Generation
- Asynchronous processing mode, returns task ID for subsequent queries
- Supports text-to-video and image-to-video (first frame / first & last frame control)
- Supports 720P and 1080P resolution
- Supports 4/6/8 second video duration
- Supports audio track generation
- Supports person generation policy control
POST
Documentation Index
Fetch the complete documentation index at: https://gccai.heqingsong.uk/llms.txt
Use this file to discover all available pages before exploring further.
Authorization
All API endpoints require Bearer Token authenticationGet your API Key:Visit the API Key Management Page to get your API KeyAdd to the request header:
Request Parameters
Video generation model nameAvailable models:
veo3.1-fast-official- Veo 3.1 Official Fast versionveo3.1-quality-official- Veo 3.1 Official High Quality version
Positive text promptDescribe scenes, actions, styles, etc. in detail for better generation results. English prompts are recommended.Example:
"a golden retriever running on the beach, sunset, cinematic"Negative prompt to exclude unwanted contentExample:
"blurry, low quality, watermark, text"Video duration in secondsRecommended values:
4 / 6 / 8Default: 8Note: Must be a pure number (e.g. 8), do not add quotes, otherwise an error will occurVideo aspect ratioAvailable values:
16:9- Landscape9:16- Portrait
16:9Video resolutionAvailable values:
720p- Standard resolution1080p- High definition4K- Ultra high definition
720pFirst frame image URL for image-to-video generation
Last frame image URL for image-to-video generationUsed with
first_frame_image to control first and last framesRandom seed for reproducing generation resultsValue range:
0 - 4294967295Number of samples to generate (1-4), currently recommended to use
1Default: 1Whether to generate audio track
Person generation policyAvailable values:
allow_adult- Only allow generating adult persons/facesdisallow- Do not allow generating persons or faces
allow_adultImage resize strategy (effective for image-to-video)Available values:
pad- Padding modecrop- Cropping mode
padWhether to enable upstream prompt enhancementDefault:
trueText-to-Video vs Image-to-Video
The system automatically determines the mode based on whether image parameters are provided: no images for text-to-video, images for image-to-video.| Parameter | Text-to-Video | Image-to-Video |
|---|---|---|
prompt | Required | Required |
first_frame_image | Not used | Required (at least one) |
last_frame_image | Not used | Optional (last frame) |
negative_prompt | Optional | Optional |
duration | Optional | Optional |
aspect_ratio | Optional | Optional |
resolution | Optional | Optional |
seed | Optional | Optional |
generate_audio | Optional | Optional |
person_generation | Optional | Optional |
resize_mode | Not applicable | Optional |
enhance_prompt | Optional | Optional |
Response
Response status code, 200 on success
Response data array
Usage Scenarios
Scenario 1: Text-to-Video (Basic)
Scenario 2: Text-to-Video (Full Parameters)
Scenario 3: Image-to-Video (Single First Frame)
Scenario 4: Image-to-Video (First Frame + Last Frame)
Scenario 5: Video with Audio
Query Task ResultsVideo generation is an asynchronous task that returns a
task_id upon submission. Use the Get Task Status endpoint to query generation progress and results.