GitHub - Maoshuiyang/FastVideo at 58cfd71b5e734edf9508f6da40c6c7f866246cf3

Name	Name	Last commit message	Last commit date
Latest commit History 49 Commits
assets	assets
demo	demo
docs	docs
fastvideo	fastvideo
scripts	scripts
.gitignore	.gitignore
LICENSE	LICENSE
README.md	README.md
env_setup.sh	env_setup.sh
pyproject.toml	pyproject.toml

Name

Last commit message

Last commit date

FastVideo is an open framework for distilling, training, and inferencing large video diffusion model.

Get 8X diffusion boost for Mochi with FastVideo

What is this?

As state-of-the-art video diffusion models grow in size and sequence length, their become prohibitive to use. For instance, sampling a 5-second 720P video with Hunyuan takes 13 minutes on 4 X A100. FastVideo aim to make large video diffusion models fast to infer and efficient to train, and thus making them more accessible.

We introduce FastMochi and FastHunyuan, distilled versions of the Mochi and Hunyuan video diffusion models. FastMochi achieves high-quality sampling with just 8 inference steps. FastHunyuan maintains sampling quality with only 4 inference steps.

What can I do with FastVideo?

Other than the distilled weight, FastVideo provides a pipeline for training, distilling, and inferencing video diffusion models. Key capabilities include:

Scalable: FastVideo supports FSDP, sequence parallelism, and selective gradient checkpointing. Our code seamlessly scales to 64 GPUs in our test.
Memory Efficient: FastVideo supports LoRA finetuning coupled with precomputed latents and text embeddings for minimal memory usage.
Variable Sequence length: You can finetuning with both image and videos.

Change Log

2024/12/16: FastVideo v0.1 is released.

🔧 Installation

The code is tested on Python 3.10.0 and CUDA 12.1.

./env_setup.sh fastvideo
conda activate fastvideo

🚀 Inference

We recommend using a GPU with 80GB of memory. To run the inference, use the following command:

FastHunyuan

# Download the model weight
python scripts/huggingface/download_hf.py --repo_id=FastVideo/FastHunyuan --local_dir=data/FastHunyuan --repo_type=model
# change the gpu count inside the script
sh scripts/inference/inference_hunyuan.sh

FastMochi

You can use FastMochi

# Download the model weight
python scripts/huggingface/download_hf.py --repo_id=FastVideo/FastMochi-diffusers --local_dir=data/FastMochi-diffusers --repo_type=model
# CLI inference
bash scripts/inference/inference_mochi_sp.sh
# Gradio web dem
python demo/gradio_web_demo.py --model_path data/FastMochi-diffusers --guidance_scale 1.5 --num_frames 163

Distillation

Please refer to the distillation guide.

Finetuning

Please refer to the finetuning guide.

Development Plan

Acknowledgement

We learned and reused code from the following projects: PCM, diffusers, and OpenSoraPlan.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

What is this?

What can I do with FastVideo?

Change Log

🔧 Installation

🚀 Inference

FastHunyuan

FastMochi

Distillation

Finetuning

Development Plan

Acknowledgement

About

Releases

Packages

Languages

License

Maoshuiyang/FastVideo

Folders and files

Latest commit

History

Repository files navigation

What is this?

What can I do with FastVideo?

Change Log

🔧 Installation

🚀 Inference

FastHunyuan

FastMochi

Distillation

Finetuning

Development Plan

Acknowledgement

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages