Stars
3
stars
written in Python
Clear filter
Reinforcement learning with unsupervised auxiliary tasks
Train Llama 2 & 3 on the SQuAD v2 task as an example of how to specialize a generalized (foundation) model.