Name		Name	Last commit message	Last commit date
Latest commit History 46 Commits
configs		configs
LICENSE		LICENSE
README.md		README.md
generate.py		generate.py
model.py		model.py
model_a.py		model_a.py
pipeline.py		pipeline.py
requirements.txt		requirements.txt
train.py		train.py

Repository files navigation

Analyzing and Improving the Training Dynamics of Diffusion Models

Overview

WIP This repository contains an unofficial implementation of the research paper Analyzing and Improving the Training Dynamics of Diffusion Models authored by Tero Karras, Janne Hellsten, Miika Aittala, Timo Aila, Jaakko Lehtinen, and Samuli Laine from NVIDIA and NVIDIA Aalto University. The paper focuses on addressing challenges in the training of the ADM diffusion model architecture, specifically targeting uncontrolled magnitude changes and imbalances in network activations and weights during training. This repo is a sample implementation of the paper using pixel-based diffusion and not latent diffusion.

Key Contributions

Modification of Network Layers: The paper proposes a systematic redesign of network layers to preserve activation weight and update magnitudes, resulting in significantly better networks without increasing computational complexity.
Magnitude-preserving Mechanisms: Several configurations like Magnitude-preserving learned layers (CONFIG D) and Magnitude-preserving fixed-function layers (CONFIG G) are introduced to control activation magnitudes and maintain a balance in the network. This repo does not implement Magnitude preserving fixed functions (CONFIG G) as they seem to be tuned for latent diffusion.
Post-hoc EMA Technique: A novel method for setting the exponential moving average (EMA) parameters post-training is presented. This allows for precise tuning of EMA length and reveals its interactions with network architecture, training time, and guidance.

Usage

Train

accelerate launch train.py

Inference

python generate.py CHECKPOINT

Requirements

pip install -r requirements.txt

References

@article{karras2023analyzing,
title={Analyzing and Improving the Training Dynamics of Diffusion Models},
author={Karras, Tero and Hellsten, Janne and Aittala, Miika and Aila, Timo and Lehtinen, Jaakko and Laine, Samuli},
journal={arXiv preprint arXiv:2312.02696},
year={2023}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Analyzing and Improving the Training Dynamics of Diffusion Models

Overview

Key Contributions

Usage

Train

Inference

Requirements

References

About

Releases

Packages

Languages

License

mmathew23/improved_edm

Folders and files

Latest commit

History

Repository files navigation

Analyzing and Improving the Training Dynamics of Diffusion Models

Overview

Key Contributions

Usage

Train

Inference

Requirements

References

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages