Submit an asynchronous video generation task using the SD2 Manxue model family. The endpoint returns a taskDocumentation Index
Fetch the complete documentation index at: https://docs.xingchaoyiqing.com/llms.txt
Use this file to discover all available pages before exploring further.
id immediately — poll the Query Video Task endpoint until the task reaches completed or failed. SD2 Manxue supports pure text-to-video, first-frame and first/last frame guidance, and a rich multi-modal reference mode that accepts images, videos, and audio together. Models range from 720p to 4K output resolution, with Fast variants for speed-sensitive workflows.
Base URL
Endpoint
Authentication
Models
Select the model based on your desired output resolution and speed requirements:| Model | Output Resolution | Notes |
|---|---|---|
sd2_manxue_720p | 720p | Standard quality, faster generation |
sd2_manxue_1080p | 1080p | Full HD, general-purpose |
sd2_manxue_2k | 2K | Higher-precision output |
sd2_manxue_4k | 4K | Maximum quality, highest detail |
sd2_manxue_fast_720p | 720p | Fast series — speed-first at 720p |
sd2_manxue_fast_1080p | 1080p | Fast series — speed-first at 1080p |
Request Parameters
Text description of the video content. Include subject, action, camera movement, and visual style.Example:
"A woman jogging on a neon-lit street at night, rain reflections, smooth camera push, cinematic"The model to use. Determines output resolution. See the model table above.Supported values:
sd2_manxue_720p, sd2_manxue_1080p, sd2_manxue_2k, sd2_manxue_4k, sd2_manxue_fast_720p, sd2_manxue_fast_1080pOutput video duration in seconds.
- Default:
5 - Supported range:
4–15seconds
Output aspect ratio. Defaults to
16:9.| Value | Description |
|---|---|
16:9 | Landscape (default) |
21:9 | Ultrawide |
4:3 | Standard screen |
1:1 | Square |
3:4 | Portrait standard |
9:16 | Portrait |
First / Last Frame Mode
URL of the first-frame image.
- Used alone: drives video from this starting frame
- Used with
last_image: first/last frame guidance mode - Accepts public URL or
asset://{assetId}for assets submitted via the asset library - Recommended image aspect ratio:
0.4–2.5; recommended size:300–6000 px - Cannot be used with
referenceImages,referenceVideos, orreferenceAudios
URL of the last-frame image. Enables first/last frame guidance when paired with
first_image.- Must be used together with
first_image - Cannot be used alone
- Recommended image aspect ratio:
0.4–2.5; recommended size:300–6000 px
Multi-Modal Reference Mode
Use these fields to provide reference media for the generated video. All three can be combined freely, but none can be combined withfirst_image or last_image.
Array of reference image URLs (or
asset://{assetId} references).- Maximum 9 images
- Accepted formats:
jpeg,png,webp,bmp,tiff,gif - Recommended aspect ratio:
0.4–2.5; recommended dimensions:300–6000 px - Cannot be used with
first_imageorlast_image
Array of reference video URLs.
- Maximum 3 videos
- Total combined duration must not exceed 15 seconds
- Total combined size must not exceed 500 MB
- Cannot be used with
first_imageorlast_image
Array of reference audio URLs.
- Maximum 3 audio files
- Total combined duration must not exceed 15 seconds
- Total combined size must not exceed 500 MB
- Cannot be used with
first_imageorlast_image
For images containing faces or virtual avatars, submit the image to the asset library first and reference it using the
asset://{assetId} format to ensure optimal results and policy compliance.Legacy Compatibility
The legacy fieldsimage (alias for first_image) and lastFrameImage (alias for last_image) remain supported. If both a legacy field and its new counterpart are provided with conflicting values, the API returns a parameter conflict error.
Response Fields
Unique task identifier. Save this value — you’ll use it to poll the Query Video Task endpoint.
Object type. Always
"video".Unix timestamp of when the task was created.
The model name used for this task.
Task status at creation. Always
"queued" on successful submission.Lifecycle values:queued— task accepted and waiting in queueprocessing— model is actively generatingcompleted— generation finished;video_urlis availablefailed— generation failed; seeerror
Error message.
null on successful submission.Generation Modes
| Mode | Required Fields | Constraints |
|---|---|---|
| Text-to-video | model, prompt | Optional: duration, ratio |
| First-frame | model, prompt, first_image | Optional: duration, ratio |
| First/last frame | model, prompt, first_image, last_image | Optional: duration, ratio |
| Image reference | model, prompt, referenceImages | Up to 9 images |
| Video reference | model, prompt, referenceVideos | Up to 3 videos, ≤15s total |
| Audio reference | model, prompt, referenceAudios | Up to 3 files, ≤15s total |
| Multi-modal | model, prompt + any combination of referenceImages, referenceVideos, referenceAudios | All reference* can be combined |
Code Examples
Example Response
Next Steps
After receiving the taskid, poll the Query Video Task endpoint to check status and retrieve video_url when generation completes.