980+ Stars40+ ForksOpen Source

TurboDiffusion:
100-200x Faster Video Diffusion

Generate 5-second videos in under 2 seconds on a single RTX 5090. Powered by SageAttention, SLA, and rCM from Tsinghua University.

View on GitHub

From Tsinghua University Machine Learning Group • Read the Paper

Side-by-side Comparison
Original184s
Original video generation
TurboDiffusion1.9s
TurboDiffusion accelerated generation
97x Faster with identical quality
100-200x
Speed Improvement
End-to-end acceleration
1.9s
5-sec Video (1.3B)
480p on RTX 5090
24s
5-sec Video (14B)
720p on RTX 5090
4
Available Models
T2V & I2V

Available Models

Four pre-trained models on HuggingFace for Text-to-Video and Image-to-Video generation

Text-to-Video

TurboWan2.1-T2V-1.3B-480P

480p

Fastest model for quick iterations and prototyping

1.9sE2E Time
184sOriginal
View on HuggingFace
Text-to-Video

TurboWan2.1-T2V-14B-480P

480p

Higher quality with 14B parameters

9.9sE2E Time
1676sOriginal
View on HuggingFace
Text-to-Video

TurboWan2.1-T2V-14B-720P

720p

Best quality for 720p text-to-video generation

24sE2E Time
4767sOriginal
View on HuggingFace
Image-to-Video

TurboWan2.2-I2V-A14B-720P

720p

Transform any image into high-quality video

38sE2E Time
4549sOriginal
View on HuggingFace

Generated Examples

Real comparisons between original Wan2.1 and TurboDiffusion-accelerated generation

Original184s
Original generation
TurboDiffusion1.9s
TurboDiffusion generation
97x faster1.3B-480P

A stylish woman walks down a Tokyo street filled with warm glowing neon and animated city signage

Original184s
Original generation
TurboDiffusion1.9s
TurboDiffusion generation
97x faster1.3B-480P

Dynamic ocean waves crashing against rocky cliffs at sunset

Original184s
Original generation
TurboDiffusion1.9s
TurboDiffusion generation
97x faster1.3B-480P

Cinematic aerial view of a futuristic city with flying vehicles

Original184s
Original generation
TurboDiffusion1.9s
TurboDiffusion generation
97x faster1.3B-480P

A majestic eagle soaring through mountain peaks in golden hour light

Original184s
Original generation
TurboDiffusion1.9s
TurboDiffusion generation
97x faster1.3B-480P

Underwater scene with colorful coral reef and tropical fish

Original184s
Original generation
TurboDiffusion1.9s
TurboDiffusion generation
97x faster1.3B-480P

Timelapse of blooming flowers in a spring garden

720P High Resolution Examples (14B Model)

Original4767s
Original 720p generation
TurboDiffusion24s
TurboDiffusion 720p generation
199x faster14B-720P
Original4767s
Original 720p generation
TurboDiffusion24s
TurboDiffusion 720p generation
199x faster14B-720P
Original4767s
Original 720p generation
TurboDiffusion24s
TurboDiffusion 720p generation
199x faster14B-720P

Technical Highlights

Combining three key innovations to achieve unprecedented acceleration

SageAttention Integration

Efficient 8-bit attention with plug-and-play inference acceleration for faster processing

Sparse-Linear Attention (SLA)

Fine-tunable sparse attention that goes beyond traditional sparsity in diffusion transformers

rCM Timestep Distillation

Score-regularized continuous-time consistency for high-quality few-step generation

Single GPU Friendly

Run on consumer hardware like RTX 4090/5090 with quantized checkpoints

Easy Integration

Simple pip install with ComfyUI support. Works with existing Wan model ecosystem

Flexible Quality/Speed

Adjustable parameters for 1-4 step sampling with configurable sigma for quality control

Ready to accelerate your video generation?

TurboDiffusion is fully open source under Apache-2.0 license. Start generating videos 100-200x faster today.