Compatibility Matrix#

The table below shows every supported model and optimizations supported for them.

The symbols used have the following meanings:

  • ✅ = Full compatibility

  • ❌ = No compatibility

Models x Optimization#

The HuggingFace Model ID can be directly pass to from_pretrained() methods and FastVideo will use the optimal default parameters when initializing and generating videos.

Model Name

HuggingFace Model ID

Resolutions

TeaCache

Sliding Tile Attn

Sage Attn

HunyuanVideo

hunyuanvideo-community/HunyuanVideo

720px1280p
544px960p

FastHunyuan

FastVideo/FastHunyuan-diffusers

720px1280p
544px960p

Wan T2V 1.3B

Wan-AI/Wan2.1-T2V-1.3B-Diffusers

480P

✅*

Wan T2V 14B

Wan-AI/Wan2.1-T2V-14B-Diffusers

480P, 720P

✅*

Wan I2V 480P

Wan-AI/Wan2.1-I2V-14B-480P-Diffusers

480P

✅*

Wan I2V 720P

Wan-AI/Wan2.1-I2V-14B-720P-Diffusers

720P

✅*

StepVideo T2V

FastVideo/stepvideo-t2v-diffusers

768px768px204f
544px992px204f
544px992px136f

Note: there are some known quality issues with Wan2.1 + Sliding Tile Attn. We are working on fixing this issue.

Special requirements#

StepVideo T2V#

  • The self-attention in text-encoder (step_llm) only supports CUDA capabilities sm_80 sm_86 and sm_90

Sliding Tile Attention#

  • Currently only Hopper GPUs (H100s) are supported.