Stars
4
results
for source starred repositories
Clear filter
Train Llama 2 & 3 on the SQuAD v2 task as an example of how to specialize a generalized (foundation) model.
Reinforcement learning with unsupervised auxiliary tasks