Fast Local minimA finding with third-order SmootHness (FLASH)

This repository contains pytorch code that produces the local minma finding algorithm in the paper: Third-order Smoothness Helps: Faster Stochastic Optimization Algorithms for Finding Local Minima.

We perform experiments of training a deep autoencoder on MNIST dataset, where the autoencoder is composed of a fully connected encoder with layers of size (28 x 28)-1024-512-256-32 and a symmetric decoder.

Prerequisites:

Python (3.6.4)
Pytorch (0.4.1)
NumPy
CUDA

Command Line Arguments:

--LR-SCSG: learning rate for scsg
--LR-NEG: learning rate for negative curvature descent
--EPOCH: total epoch for the algorithm
--BATCH-SIZE: mini batch size for scsg in training

Usage Examples:

Run experiments on MNIST:

  -  python train_flash.py  --EPOCH 500

Reference

Third-order Smoothness Helps: Faster Stochastic Optimization Algorithms for Finding Local Minima. Yaodong Yu*, Pan Xu* and Quanquan Gu, (*: equal contribution). NeurIPS-2018.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
model.py		model.py
ncd_step.py		ncd_step.py
scsg_step.py		scsg_step.py
train_flash.py		train_flash.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Fast Local minimA finding with third-order SmootHness (FLASH)

Prerequisites:

Command Line Arguments:

Usage Examples:

Reference

About

Releases

Packages

Contributors 2

Languages

License

uclaml/FLASH

Folders and files

Latest commit

History

Repository files navigation

Fast Local minimA finding with third-order SmootHness (FLASH)

Prerequisites:

Command Line Arguments:

Usage Examples:

Reference

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages