two_tower_models

This is companion repository to two tower models, a commonly used approach to retrieval / candidate generation in recommender systems. The goal of this repository is to show how to increase the alignment of retrieval with ranking.

two_tower_base_retrieval.py

This shows a sample implementation of two tower models in PyTorch.

user_history_encoder.py

This is text-book implementation of self attention and positional encodings to summarise user history for the user tower.

two_tower_with_user_history_encoder.py

This uses the UserHistoryEncoder above to add to the user tower.

two_tower_with_position_debiased_weights.py

Since we are using net user value to estimate the relevance, this uses a weighting approach to position debiasing. There are other approaches as well like logit debiasing via shallow tower that we recommend trying out.

two_tower_plus_light_ranker.py

This extends TwoTowerWithPositionDebiasedWeights from above and implements a pointwise ranking module, broadly described in Revisting neural accelerators. We chose this since implementing a light ranking in the retrieval system is an intuitive way to increase consistency with main ranking.

two_tower_plus_light_ranker_plus_main_ranker_kd.py

Then we extend this to adding knowledge distillation from ranking models. This is similar to the approach described in How to reduce the cost of ranking by knowledge distillation

two_tower_base_plus_main_ranker_reward_model.py

In this step we add a further layer of funnel consistency where, inspired by RLHF, we use the ranking model as a "reward model" and learn how to make the retrieval more aligned with the ranking model.

baseline_mips_module.py

This is a helper file with a matrix multiplication approach to maximum inner product search. This helps us in having a PyTorch implementation to write tests on.

References

Seminal paper on two tower models in Youtube

Disclaimer: These are the personal creations/opinions of the author(s). Any artifacts, opinions stated here are theirs and not representative of their current or prior employer(s). Apart from publicly available information, any other information here is not claimed to refer to any company including ones the author(s) may have worked in or been associated with.

Name		Name	Last commit message	Last commit date
Latest commit History 44 Commits
images		images
src		src
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

two_tower_models

two_tower_base_retrieval.py

user_history_encoder.py

two_tower_with_user_history_encoder.py

two_tower_with_position_debiased_weights.py

two_tower_plus_light_ranker.py

two_tower_plus_light_ranker_plus_main_ranker_kd.py

two_tower_base_plus_main_ranker_reward_model.py

baseline_mips_module.py

References

About

Releases

Packages

Languages

License

farouqzaib/two_tower_models

Folders and files

Latest commit

History

Repository files navigation

two_tower_models

References

About

Resources

License

Stars

Watchers

Forks

Languages