Generate a video from a text prompt. Describe the scene, camera motion, and mood — the model returns a video with matching audio.
Returns the generated video directly in the response. For an asynchronous version that returns a job to poll, see [v2/text-to-video](/api-documentation/api-reference/async-video-generation/submit-text-to-video).
Billed per second of generated video. See [Pricing](/pricing).
Request
This endpoint expects an object.
promptstringRequired<=5000 characters
Text prompt describing the desired video content
durationintegerRequired
Video duration in seconds. See Supported Models for available durations per model.
resolutionstringRequired
Output video resolution. See Supported Models for available resolutions per model.
fpsintegerOptionalDefaults to 24
Frame rate in frames per second. See Supported Models for available FPS per model and resolution.
generate_audiobooleanOptionalDefaults to true
Generate audio for the video. When true, the generated video will include AI-generated audio matching the scene. When false, only silent video is generated.
camera_motionenumOptional
Apply camera motion effects to the generated video.
Response
Video generated successfully