Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
TrainCompactQuadraticNetworksViaReLinear		TrainCompactQuadraticNetworksViaReLinear
TrainQuadraticNetworksViaReLinear+ReZero		TrainQuadraticNetworksViaReLinear+ReZero
TrainQuadraticNetworksViaReLinear		TrainQuadraticNetworksViaReLinear
Figure_IWL.png		Figure_IWL.png
Figure_guaranteed_improvements.pdf		Figure_guaranteed_improvements.pdf
Figure_guaranteed_improvements.png		Figure_guaranteed_improvements.png
README.md		README.md

Repository files navigation

Expressivity and Trainability of Quadratic Networks

| ArXiv |

This respository includes implementations of the algorithm "ReLInear" proposed in Expressivity and Trainability of Quadratic Networks. The ReLinear encourages the model to learn suitable quadratic terms gradually and adaptively in reference to the corresponding linear terms. The ReLinear method has the following two steps. First, the quadratic weights in each neuron are set to $w^g = 0, b^g = 1$ and $w^b = 0, c = 0$. Such an initialization degenerates a quadratic neuron into a conventional neuron. Second, quadratic terms are regularized in the training process. Intuitively, two ways of regularization: shrinking the gradients of quadratic weights and shrinking quadratic weights.

Figure 1. The performance of a quadratic network trained using the proposed ReLinear method, with an observed improvement than the conventional network of the same structure. $(\gamma_g,\gamma_b)$, $(\alpha_g,\alpha_b)$, and $(\beta_g,\beta_b)$ are hyperparameters of ReLinear. As these hyperparameters increases from 0, the trained model transits from the conventional model to the quadratic, and the model's performance reaches the optimality.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Expressivity and Trainability of Quadratic Networks

About

Releases

Packages

Languages

wdayang/ReLinear

Folders and files

Latest commit

History

Repository files navigation

Expressivity and Trainability of Quadratic Networks

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages