Pricing

API pricing for video generation

Video generation is billed per second of output video. Higher resolution and premium models have proportionally higher costs.

Text-to-Video

Generate videos from text prompts. Pricing is based on the duration of the generated video in seconds.

EndpointModelResolutionCost per second
v1/text-to-videoltx-2-fast1920x1080$0.04
2560x1440$0.08
3840x2160$0.16
ltx-2-pro1920x1080$0.06
2560x1440$0.12
3840x2160$0.24

Image-to-Video

Generate videos from an input image. Pricing is based on the duration of the generated video in seconds.

EndpointModelResolutionCost per second
v1/image-to-videoltx-2-fast1920x1080$0.04
2560x1440$0.08
3840x2160$0.16
ltx-2-pro1920x1080$0.06
2560x1440$0.12
3840x2160$0.24

Audio-to-Video

Generate videos synchronized to audio input. Pricing is based on the duration of the input audio in seconds.

EndpointModelResolutionCost per second
v1/audio-to-videoltx-2-pro1920x1080$0.10

Retake (Video Editing)

Edit and regenerate portions of an existing video. Pricing is based on the duration of the input video.

EndpointModelResolutionCost per second
v1/retakeltx-2-pro1920x1080$0.10

Extend (Video Extension)

Extend a video by generating additional frames at the beginning or end. Pricing is based on the duration of the extended portion plus context frames from the input video, capped at a total of 505 billed frames. The resulting billed seconds depend on the input video’s frame rate (~21 seconds at 24fps).

EndpointModelResolutionCost per second
v1/extendltx-2-pro1920x1080$0.10