Skip to content

Adaptation of Hybrid methods to softmax sampling. Code for some results and the general hybrid mechanisms for our work in https://openreview.net/pdf?id=EMigfE6ZeS

License

Notifications You must be signed in to change notification settings

arijitthegame/hybrid-sampling

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

74 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Code to run hybrid sampling schemes to compare against the softmax sampling. The code can be used to replicate the results in our ICLR paper https://openreview.net/pdf?id=EMigfE6ZeS.

All experiments are run on langauge modeling tasks. The datasets used are Penn Tree Bank, Wikitext 2 and Wikitext 103. We will just use a LSTM/biLSTM for all the experiments. The goal here is to prove the efficacy of these hybrid schemes and not the model itself. The code supports some experiments in our work "Hybrid Random features" https://arxiv.org/pdf/2110.04367.pdf
Added: sampled softmax. This is a simple reimplementation of https://www.tensorflow.org/api_docs/python/tf/nn/sampled_softmax_loss
Added weight tying as in https://arxiv.org/pdf/1608.05859v3.pdf
Added all the kernel methods except the clustering estimators.

The code needs cleanup.

Added Alias sampling methods. https://lips.cs.princeton.edu/the-alias-method-efficient-sampling-with-many-discrete-outcomes/

Implementation of https://papers.nips.cc/paper/2019/file/e43739bba7cdb577e9e3e4e42447f5a5-Paper.pdf

About

Adaptation of Hybrid methods to softmax sampling. Code for some results and the general hybrid mechanisms for our work in https://openreview.net/pdf?id=EMigfE6ZeS

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published