Compatibility Matrix¶

The table below shows every supported model and optimizations supported for them.

The symbols used have the following meanings:

✅ = Full compatibility
❌ = No compatibility
⭕ = Does not apply to this model

Models x Optimization¶

The HuggingFace Model ID can be directly pass to from_pretrained() methods and FastVideo will use the optimal default parameters when initializing and generating videos.

Model Name	HuggingFace Model ID	Resolutions	TeaCache	Sliding Tile Attn	Sage Attn	VSA	BSA
FastWan2.1 T2V 1.3B	`FastVideo/FastWan2.1-T2V-1.3B-Diffusers`	480P	⭕	⭕	⭕	✅	⭕
FastWan2.2 TI2V 5B Full Attn*	`FastVideo/FastWan2.2-TI2V-5B-FullAttn-Diffusers`	720P	⭕	⭕	⭕	✅	⭕
Wan2.2 TI2V 5B	`Wan-AI/Wan2.2-TI2V-5B-Diffusers`	720P	⭕	⭕	✅	⭕	⭕
Wan2.2 T2V A14B	`Wan-AI/Wan2.2-T2V-A14B-Diffusers`	480P 720P	❌	❌	✅	⭕	⭕
Wan2.2 I2V A14B	`Wan-AI/Wan2.2-I2V-A14B-Diffusers`	480P 720P	❌	❌	✅	⭕	⭕
HunyuanVideo	`hunyuanvideo-community/HunyuanVideo`	720px1280p 544px960p	❌	✅	✅	⭕	⭕
FastHunyuan	`FastVideo/FastHunyuan-diffusers`	720px1280p 544px960p	❌	✅	✅	⭕	⭕
Wan2.1 T2V 1.3B	`Wan-AI/Wan2.1-T2V-1.3B-Diffusers`	480P	✅	✅*	✅	⭕	⭕
Wan2.1 T2V 14B	`Wan-AI/Wan2.1-T2V-14B-Diffusers`	480P, 720P	✅	✅*	✅	⭕	⭕
Wan2.1 I2V 480P	`Wan-AI/Wan2.1-I2V-14B-480P-Diffusers`	480P	✅	✅*	✅	⭕	⭕
Wan2.1 I2V 720P	`Wan-AI/Wan2.1-I2V-14B-720P-Diffusers`	720P	✅	✅	✅	⭕	⭕
StepVideo T2V	`FastVideo/stepvideo-t2v-diffusers`	768px768px204f 544px992px204f 544px992px136f	❌	❌	✅	⭕	⭕
TurboWan2.1 T2V 1.3B	`loayrashid/TurboWan2.1-T2V-1.3B-Diffusers`	480P	⭕	⭕	⭕	⭕	⭕
TurboWan2.1 T2V 14B	`loayrashid/TurboWan2.1-T2V-14B-Diffusers`	480P, 720P	⭕	⭕	⭕	⭕	⭕
LongCat T2V 13.6B	See note**	480P 720P	❌	❌	❌	⭕	✅
Matrix Game 2.0 Base	`FastVideo/Matrix-Game-2.0-Base-Diffusers`	352x640	⭕	⭕	⭕	⭕	⭕
Matrix Game 2.0 GTA	`FastVideo/Matrix-Game-2.0-GTA-Diffusers`	352x640	⭕	⭕	⭕	⭕	⭕
Matrix Game 2.0 TempleRun	`FastVideo/Matrix-Game-2.0-TempleRun-Diffusers`	352x640	⭕	⭕	⭕	⭕	⭕

Note: Wan2.2 TI2V 5B has some quality issues when performing I2V generation. We are working on fixing this issue.

Special requirements¶

StepVideo T2V¶

The self-attention in text-encoder (step_llm) only supports CUDA capabilities sm_80 sm_86 and sm_90

Sliding Tile Attention¶

Currently only Hopper GPUs (H100s) are supported.

TurboWan2.1 (TurboDiffusion)¶

Uses TurboDiffusionPipeline with RCM scheduler for 1-4 step generation
Requires SLA attention backend: export FASTVIDEO_ATTENTION_BACKEND=SLA_ATTN
Uses guidance_scale=1.0 (no classifier-free guidance)

Matrix Game 2.0¶

Image-to-video game world models with keyboard/mouse control input
Three variants available: Base (universal), GTA, and TempleRun
Each variant has different keyboard dimensions for control inputs