GitHub - lucidrains/adam-atan2-pytorch at 0.0.3

Branches Tags

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.github/workflows		.github/workflows
adam_atan2_pytorch		adam_atan2_pytorch
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
adam-atan2.png		adam-atan2.png
pyproject.toml		pyproject.toml

Repository files navigation

Adam-atan2 - Pytorch

Implementation of the proposed Adam-atan2 optimizer in Pytorch

A multi-million dollar paper out of google deepmind proposes a small change to Adam update rule (using atan2) to remove the epsilon altogether for numerical stability and scale invariance

Install

$ pip install adam-atan2-pytorch

Usage

# toy model

import torch
from torch import nn

model = nn.Linear(10, 1)

# import AdamAtan2 and instantiate with parameters

from adam_atan2_pytorch import AdamAtan2

opt = AdamAtan2(model.parameters(), lr = 1e-4)

# forward and backwards

for _ in range(100):
  loss = model(torch.randn(10))
  loss.backward()

  # optimizer step

  opt.step()
  opt.zero_grad()

Todo

foreach version

Citations

@inproceedings{Everett2024ScalingEA,
    title   = {Scaling Exponents Across Parameterizations and Optimizers},
    author  = {Katie Everett and Lechao Xiao and Mitchell Wortsman and Alex Alemi and Roman Novak and Peter J. Liu and Izzeddin Gur and Jascha Narain Sohl-Dickstein and Leslie Pack Kaelbling and Jaehoon Lee and Jeffrey Pennington},
    year    = {2024},
    url     = {https://api.semanticscholar.org/CorpusID:271051056}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Adam-atan2 - Pytorch

Install

Usage

Todo

Citations

About

Releases 28

Packages

Languages

License

lucidrains/adam-atan2-pytorch

Folders and files

Latest commit

History

Repository files navigation

Adam-atan2 - Pytorch

Install

Usage

Todo

Citations

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 28

Packages 0

Languages

Packages