Name	Name	Last commit message	Last commit date
parent directory ..
opt_list	opt_list
README.md	README.md
requirements.txt	requirements.txt
run.sh	run.sh
setup.py	setup.py

Learned Optimizer List

What this is

The “Learned Optimizer List” is a sequential list of hyperparameters that have been selected based on performance on a large set of machine learning tasks.

Instead of worrying about finding a good hyperparameter search space for your problem, or complex hyperparameter search methods, we suggest you try using this list of hyperparameters instead.

Wondering how this list is made? See our paper: "Using a thousand optimization tasks to learn hyperparameter search strategies" see task_set.

Contact Luke Metz ([email protected]) for questions or issues.

How to use it

We provide the hyperparameter list as a drop-in optimizer replacement in your favorite machine learning framework (TensorFlow, PyTorch, and Jax).

To use the list of hyperparameters, first figure out how many training steps you want to train your model for, and second, try multiple different configurations from the list in order (the idx argument). The list is sorted, and we have found a small number of trials, e.g. under 10, should lead to good results, but for best performance continue trying values up to 100.

Try it out on your problem and let us know where it works, and where it doesn't!

We provide full example usage in opt_list/examples/ and code snippets below. For all examples we assume you are in the google_research/opt_list directory.

Remove your weight decay!

For best performance, we recommend turning off any form l2 regularization or weight decay. These optimizers already manage this for you.

Installation

Install via pip from github:

 pip install git+https://github.com/google-research/google-research.git#subdirectory=opt_list

Alternatively, clone the repository and install the module from there.

git clone https://github.com/google-research/google-research.git
cd google-research/opt_list/
pip install -e .

Examples

PyTorch

Full example: python3 -m opt_list.examples.torch

from opt_list import torch_opt_list

opt = torch_opt_list.optimizer_for_idx(model.parameters(), idx=0,
                                       training_steps=training_steps)
for i in range(training_steps):
  loss = forward()
  opt.zero_grad()
  loss.backward()
  opt.step()

TF V1

Full example: python3 -m opt_list.examples.tf_v1

from opt_list import tf_opt_list

global_step = tf.train.get_or_create_global_step()
opt = tf_opt_list.optimizer_for_idx(0, training_steps, iteration=global_step)
train_op = opt.minimize(loss)

with tf.Session() as sess:
  for i in range(training_iters):
    sess.run([train_op])

TF2.0 / Keras

Full example: python3 -m opt_list.examples.tf_keras

from opt_list import tf_opt_list

opt = tf_opt_list.keras_optimizer_for_idx(0, training_steps)
model.compile(loss='mse', optimizer=opt, metrics=[])

for i in range(training_steps):
  model.train_on_batch(inp, target)

Jax: Flax

Full example: python3 -m opt_list.examples.jax_flax

from opt_list import jax_flax_opt_list

optimizer_def = jax_flax_opt_list.optimizer_for_idx(idx=0, training_steps)
optimizer = optimizer_def.create(model)
for i in range(training_steps):
  optimizer, loss = optimizer.optimize(loss_fn)

Jax: Optimizers (jax.experimental.optimizers)

Full example: python3 -m opt_list.examples.jax_optimizers

from opt_list import jax_optimizers_opt_list

opt_init, opt_update, get_params = jax_optimizers_opt_list.optimizer_for_idx(
    0, training_iters)
opt_state = opt_init(params)

for i in range(training_steps):
  params = get_params(opt_state)
  opt_state = opt_update(i, jax.grad(loss_fn)(params, batch), opt_state)

Jax: Optix (jax.experimental.optix)

Full example: python3 -m opt_list.examples.jax_optix

For now, optix doesn't support AdamW style weight decay. As such this will NOT be a drop in replacement but will follow a similar API.

from opt_list import jax_optix_opt_list

opt = jax_optix_opt_list.optimizer_for_idx(idx=0, training_steps)
opt_state = opt.init(params)

for i in range(training_steps):
  grads = jax.grad(loss_fn)(params, batch)
  # Not opt.update! We need parameter values too!
  updates, opt_state = opt.update_with_params(grads, params, opt_state)
  params = optix.apply_updates(params, updates)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

opt_list

opt_list

README.md

Learned Optimizer List

What this is

How to use it

Remove your weight decay!

Installation

Examples

PyTorch

TF V1

TF2.0 / Keras

Jax: Flax

Jax: Optimizers (jax.experimental.optimizers)

Jax: Optix (jax.experimental.optix)

Files

opt_list

Directory actions

More options

Directory actions

More options

Latest commit

History

opt_list

Folders and files

parent directory

README.md

Learned Optimizer List

What this is

How to use it

Remove your weight decay!

Installation

Examples

PyTorch

TF V1

TF2.0 / Keras

Jax: Flax

Jax: Optimizers (jax.experimental.optimizers)

Jax: Optix (jax.experimental.optix)