PyTorch_OLoptim

A Pytorch implementation of various Online / Stochastic optimization algorithms

Descriptions

FTRL: Follow the Regularized Leader

FTML: [ICML 2017] Follow the Moving Leader in Deep Learning

SGDOL: [NeurIPS 2019] Surrogate Losses for Online Learning of Stepsizes in Stochastic Non-Convex Optimization

STORM: [NeurIPS 2019] Momentum-Based Variance Reduction in Non-Convex SGD

EXP3: Exponential-weight algorithm for Exploration and Exploitation

intro: a classic algorithm for (adversarial) multi-armed bandit problem. Implement it as a learning rate scheduler
original paper: https://cseweb.ucsd.edu/~yfreund/papers/bandits.pdf
a nice blog post: https://parameterfree.com/2019/11/12/multi-armed-bandit-i/

UCB: Upper Confidence Bound algorithm

intro: a classic algorithm for stochastic multi-armed bandit problem, which achieves the optimal regret bound while being parameter-free. Implement it as a learning rate scheduler
a nice blog post: https://parameterfree.com/2019/11/21/multi-armed-bandit-iv-ucb/

SGDPF

intro: a toy example to use gradient descent to automatically tune the learning rate. The name comes from 'SGD + parameter free'

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
OLoptim		OLoptim
OLscheduler		OLscheduler
README.md		README.md
example.py		example.py