Create Gemini Omni Video
创建 Gemini Omni 视频生成任务。任务创建后,请使用通用视频任务查询接口获取生成状态和结果。
task_id. Use the Query Video Task endpoint to poll task status and read the final video URL from result.video_url.Supported models
| model | Notes |
|---|---|
Gemini-Omini | Standard Gemini Omni — 4/6/8/10s with 720p/1080p/4k |
omni-10s | Fixed 10s, 720p, up to 7 reference images (image or video URLs) |
Omni-Flash-Ext and gemini-omni normalize to Gemini-Omini.
Reference Video & Generation Modes
The standard version ofGemini-Omini supports the following advanced parameters to enable video reference and custom modes:
generation_type(string): Generation mode, supportingframe(first-frame mode, whereimage_urlscan only take 1 image as the first frame of the video) orreference(reference mode, whereimage_urlscan accept 1 or 3 reference images). Defaults toreference.video_urls(array): A list of reference video URLs (currently supports up to 1 video). Note:durationis not required when uploading a reference video, anddurationandvideo_urlscannot be passed at the same time.
omni-10s example
omni-10s, duration and resolution in the request are ignored; billing uses the omni-10s model config (10s, 720p).Authorizations
Bearer authentication header of the form Bearer <token>, where <token> is your auth token.
Body
Model name. Gemini-Omini, Omni-Flash-Ext, and gemini-omni normalize to Gemini-Omini; omni-10s is a fixed 10s model (720p, up to 7 reference images).
Gemini-Omini, Omni-Flash-Ext, gemini-omni, omni-10s Text prompt for video generation
Video duration in seconds. Default: 6. Only supports: 4, 6, 8, 10. Note: duration is not required when uploading a reference video, and duration and video_urls cannot be passed at the same time.
Video resolution. Defaults to 720p.
720p, 1080p, 4k Video aspect ratio, for example 16:9 or 9:16. Takes precedence over orientation.
Video orientation. Used only when aspect_ratio is empty: landscape maps to 16:9, portrait maps to 9:16.
landscape, portrait Reference image URL list. omni-10s supports up to 7 images.
Compatibility field for common OpenAPI image inputs; used when image_urls is empty. omni-10s supports up to 7 images.
Generation type. frame: first frame mode (image_urls can only accept 1 image as the first frame of the video); reference: reference mode (image_urls can accept 1 or 3 images as style/subject reference). Defaults to reference.
frame, reference Reference video URL list (currently supports up to 1 video). Note: duration is not required when uploading a reference video, and duration and video_urls cannot be passed at the same time.