Submit an asynchronous video generation task using the Videos model series. The endpoint returns a taskDocumentation Index
Fetch the complete documentation index at: https://docs.xingchaoyiqing.com/llms.txt
Use this file to discover all available pages before exploring further.
id immediately — you then poll the Query Video Task endpoint until the task reaches completed or failed. The Videos series supports pure text-to-video, first/last frame guidance, and a powerful multi-modal reference mode combining image and video inputs.
Base URL
Endpoint
Authentication
Models
| Model | Billing | Duration | Resolution | Best For |
|---|---|---|---|---|
videos | Per call | 4–15s | 720p | Standard quality output |
videos_fast | Per call | 4–15s | 720p | Speed-sensitive workflows |
Request Parameters
The model to use for generation.Supported values:
videos— standard modelvideos_fast— faster generation
Text description of the video content. Maximum 1500 characters.Include subject, action, camera movement, and visual style. Avoid content that violates platform policies (e.g., restricted, infringing, or sensitive subject matter).Example:
"A cute kitten playing on a sunlit meadow, cinematic lens"Output video duration in seconds. Required.
- Supported range:
4–15seconds
Output aspect ratio. Defaults to
16:9.| Value | Description |
|---|---|
16:9 | Landscape (default) |
9:16 | Portrait |
1:1 | Square |
Output resolution. Currently only
720p is supported.Default: 720pFirst / Last Frame Mode
Use these fields to guide the video’s starting and ending frames. Both fields are required together.URL of the first-frame image.
- Required to be used with
last_image - Cannot be combined with
referenceImagesorreferenceVideos
URL of the last-frame image.
- Required to be used with
first_image - Cannot be combined with
referenceImagesorreferenceVideos
Reference Media Mode
Use these fields to provide reference images and/or videos that guide the generated output. They can be used individually or combined.Array of reference image URLs.
- Maximum 4 images
- Each image must be under 20 MB
- Can be combined with
referenceVideosfor mixed-media reference - Cannot be used with
first_imageorlast_image
Array of reference video URLs.
- Maximum 3 videos
- Total combined duration must not exceed 15 seconds
- Total combined size must not exceed 200 MB
- Each video must be between 720px and 2160px in resolution
- Can be combined with
referenceImages - Cannot be used with
first_imageorlast_image
Legacy Compatibility
The legacy fields
image (alias for first_image) and lastFrameImage (alias for last_image) are still supported for backwards compatibility.If you pass both a legacy field and its new counterpart with conflicting values, the API will return a parameter conflict error. When both are present with the same value, first_image/last_image take precedence.Response Fields
Unique task identifier. Save this value — you’ll use it to poll the Query Video Task endpoint.
Object type. Always
"video".Unix timestamp of when the task was created.
The model name used for this task.
Task status at creation. Always
"queued" on successful submission.Lifecycle values:queued— task accepted and waiting in queueprocessing— model is actively generatingcompleted— generation finished;video_urlis availablefailed— generation failed; seeerror
Error message.
null on successful submission.Generation Modes
| Mode | Fields Required | Notes |
|---|---|---|
| Text-to-video | model, prompt, duration | Optional: ratio, resolution |
| First/last frame | model, prompt, duration, first_image, last_image | Both frame fields required together |
| Image reference | model, prompt, duration, referenceImages | Up to 4 images, each < 20 MB |
| Video reference | model, prompt, duration, referenceVideos | Up to 3 videos, ≤15s total |
| Image + video | model, prompt, duration, referenceImages, referenceVideos | Combined constraints apply |
Code Examples
Example Response
Next Steps
After receiving the taskid, poll the Query Video Task endpoint to check status and retrieve video_url when generation completes.