Capability Matrix

This page compares PixVerse generation models and standalone capabilities.

Use this matrix to check which model or endpoint supports your workflow before integrating the API.

Generation Model Capabilities

These capabilities require choosing a generation model, such as C1 or V6.

Capability	C1	V6	Legacy models
Text-to-video	Supported	Supported	Varies
Image-to-video	Supported	Supported	Varies
Transition(first and last frame)	Supported	Supported	Varies
Reference-to-video / Fusion	Supported	Supported	Varies
Video extension	Not supported	Supported	Varies
Inline Multi-clip generation	Supported with prompt	Supported with parameters	v5.5 above
Inline audio generation	Supported	Supported	v5.5 above
Max duration	15s	15s	Varies
Max resolution	1080p	1080p	1080p
Pricing	Per second	Per second	Varies

Standalone Capabilities

These capabilities are available through dedicated endpoints. They do not require selecting C1 or V6 as the generation model unless the endpoint documentation says otherwise.

Capability	Endpoint type	What it does
Restyle	Restyle	Change the visual style of an existing video
Swap	Swap & Swam-mask	Replace a subject or region in a video
Mimic	Motion control	Replicate motion from a reference source
Modify	Modify	Edit video content using a text prompt
Sound Effects	Sound effects generation	Generate synchronized audio for a video
Lip Sync	Lip sync	Align speech to mouth movement in a video
Image Template	Image template generation	Generate images from predefined templates

For full parameter schemas, supported input formats, pricing, and limitations, see the API Reference.

Capability matrix

Capability Matrix#

Generation Model Capabilities#

Standalone Capabilities#

Capability Matrix

Generation Model Capabilities

Standalone Capabilities