Skip to content

Stochastic Quasi-Newton Methods in a Trust Region Framework (MATLAB implementation)

Notifications You must be signed in to change notification settings

manuel-suarez/sL_QN_TR

 
 

Repository files navigation

blue-lines-png-7

sL_QN_TR

Stochastic Quasi-Newton Methods in a Trust Region Framework (MATLAB Implementation)

The repository "sL_QN_TR" contains programs of s-LBFGS-TR and s-LSR1-TR which are stochastic algorithms based on two well-known quasi-Newton updates, i.e., limited memory BFGS and limited memory SR1, in a Trust Region Framework. These algorithms, their performance to train deep neural networks in image classification and their implementation points of algorithms from creating a network to training are described in the following paper:

On the efficiency of Stochastic Quasi-Newton Methods for Deep Learning

Mahsa Yousefi and Angeles Martinez Calomardo

Read and Download the Paper.

Download the Supplementary Material of the Paper.

A MATLAB-based tutorial on implementing of training loops for a deep neural network

Mahsa Yousefi and Angeles Martinez Calomardo

This tutorial shows you how to define a convolutional neural network (CNN) and how to create customize your own training loops. If you are a MATLAB user who would like to implement their own training algorithm for which the MATLAB built-in function does not exist, read this tutorial.

Read and Download the Tutorial.

blue-lines-png-7 In this repository

  • All programs have been written in MATLAB using the DL toolbox.
  • There are four main programs:
  • Every single program in .mlx format (live script of MATLAB) provides a step-by-step guideline for users.
  • Architectures with and without batch normalization (BN) layers:
    • LeNet-like, ResNet-20 and ConvNet3FC2 without BN layers are included in sL_BFGS_TR_noBN.mlx and sL_SR1_TR_noBN.mlx
    • ResNet-20 and ConvNet3FC2 with BN layers are considered in sL_BFGS_TR.mlx and sL_SR1_TR.mlx.
  • The folder Subroutines includes required functions (.m files) for running programs.
  • The folder Datasets consists three folders associated with standard benchmarks MNIST, Fashion-MNIST and CIFAR10 with their own loading file.
  • The following file provides methods for producing figures to illusterate comparisions between sL_BFGS_TR and sL_SR1_TR:
  • To create figures for the example described in Build Figure.mlx, the values of loss, accuracy, and training time stored in the folder Info are required.
  • For executing, put together main programs with all .m files and required files from Subroutines and Datasets, respectively.

blue-lines-png-7

Please contact me if you have any questions, suggestions, requests or bug-reports.

[email protected]

About

Stochastic Quasi-Newton Methods in a Trust Region Framework (MATLAB implementation)

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • MATLAB 100.0%