GlobalAiOpc
Grok - Create Task
Create video generation tasks using Grok models for text-to-video and image-to-video
Grok Create Video Task
Create video generation tasks using Grok models for text-to-video and image-to-video use cases.
Authentication
Authentication
Get KeyAll requests require a Bearer token in the request header:
cURL
Authorization: Bearer {{key}}
Base URL
https://zcbservice.aizfw.cn/kyyReactApiServerbaseUrl is the shared prefix for all public APIs. The api field in the current page frontmatter shows the full endpoint. Use this baseUrl as the common prefix when reading or composing request paths.Model overview
Supported models:
grok_video3: Variable duration from6-30seconds, supports480p/720p, defaulting to10seconds +16:9+720pgrok_video3_pro: Fixed10second output, handled by default with16:9+720pgrok_video3_max: Supports6 / 10 / 12 / 16 / 20 / 30seconds, supports480p/720p, defaults to10seconds +16:9+720pgrok_video3_stable: Fixed10second output, supports720ponly, defaults to16:9
Request parameters
modelbodystringrequiredModel name.
Available values:
grok_video3- Variable duration from6-30secondsgrok_video3_pro- Fixed10second outputgrok_video3_max- More fixed duration choices and stricter image constraintsgrok_video3_stable- Fixed10second output, supports720ponly
promptbodystringrequiredPrompt describing the video you want to generate.
Example:
"A cat dancing in the rain, cinematic style"durationbodyintegerVideo duration in seconds.
Rules:
grok_video3: supports6-30; default is10grok_video3_pro: fixed at10, so you do not need to send itgrok_video3_max: only supports6,10,12,16,20,30; default is10grok_video3_stable: fixed at10; omit it or send10
aspect_ratiobodystringOutput aspect ratio.
Rules:
grok_video3: supports16:9,9:16,1:1,3:2,2:3; default is16:9grok_video3_pro: defaults to16:9grok_video3_max: only supports16:9,9:16,1:1; default is16:9grok_video3_stable: supports16:9,9:16,3:2,2:3,1:1; default is16:9
resolutionbodystringOutput resolution.
Rules:
grok_video3: supports480pand720p; default is720pgrok_video3_pro: handled by default as720pgrok_video3_max: supports480pand720p; default is720pgrok_video3_stable: supports720ponly; omit it or send720p
image_urlsbodyarrayReference image URL array for image-to-video scenarios.
Rules:
grok_video3: up to7reference imagesgrok_video3_pro: supports reference imagesgrok_video3_max: up to5public URLsgrok_video3_stable: up to7public URLs
Example:
["https://example.com/image1.jpg", "https://example.com/image2.jpg"]Response parameters
idstringUnique identifier of the video generation task. Use it to query task status later.
objectstringObject type. Always
video.createdintegerTask creation timestamp.
modelstringModel name used to create the task.
statusstringTask status. On successful creation, it is always
queued.errorstringError information. It is usually
null when task creation succeeds.Parameter notes
Key constraints:
- Omit
image_urlsfor text-to-video - Send
image_urlsfor image-to-video grok_video3_maxhas the strictest validation for aspect ratio, duration, and resolution, so use supported values directlygrok_video3_stableonly accepts publicly accessible reference image URLs, not base64 content
Model comparison
| Feature | grok_video3 | grok_video3_pro | grok_video3_max | grok_video3_stable |
|---|---|---|---|---|
| Duration | 6-30 seconds, default 10 | Fixed 10 seconds | 6 / 10 / 12 / 16 / 20 / 30 seconds, default 10 | Fixed 10 seconds |
| Ratio | 16:9 / 9:16 / 1:1 / 3:2 / 2:3 | Default 16:9 | 16:9 / 9:16 / 1:1, default 16:9 | 16:9 / 9:16 / 3:2 / 2:3 / 1:1, default 16:9 |
| Quality | 480p / 720p, default 720p | Default 720p | 480p / 720p, default 720p | 720p only |
| Reference images | Up to 7 | Compatible with reference image input | Up to 5 public URLs | Up to 7 public URLs |
Best practices:
- The create API returns a task object. Save its
idfor follow-up queries. - Make prompts specific about subject, action, camera movement, and style.
- If you use
grok_video3_max, prefer high-quality public image URLs. - If you use
grok_video3_stable, all reference images must be publicly accessible URLs. - More reference images usually mean longer queue and generation times.
