🎯
Focusing
This is the moment
-
Peking University
- Beijing
-
03:36
(UTC +08:00) - https://al-377.github.io/
Stars
LLM-tuning
The tools for LLM tuning
3 repositories
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
Train transformer language models with reinforcement learning.