-
This is a list of peer-reviewed representative papers on deep learning dynamics (optimization dynamics of neural networks). The success of deep learning attributes to both network architecture and …
-
[NeurIPS 2023] The PyTorch Implementation of Scheduled (Stable) Weight Decay.
-
adaptive-inertia-adai Public
[ICML 2022, Oral] The PyTorch Implementation of Adaptive Inertia Methods. The algorithms are based on our paper: "Adaptive Inertia: Disentangling the Effects of Adaptive Learning Rate and Momentum".
-
Positive-Negative-Momentum Public
[ICML 2021] The official PyTorch Implementations of Positive-Negative Momentum Optimizers.
-
[Neural Computation, MIT Press] The PyTorch Implementation of Variable Optimizers/ Neural Variable Risk Minimization proposed in our Neural Computation paper: Artificial Neural Variability for Deep…