Stars
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
MichaelCola / LLM-RLHF-Tuning
Forked from Joyce94/LLM-RLHF-TuningLLM Tuning with PEFT (SFT+RM+PPO+DPO with LoRA)
The official GitHub page for the survey paper "A Survey of Large Language Models".
Aligning Large Language Models with Human: A Survey
A curated list of reinforcement learning with human feedback resources (continually updated)
Making large AI models cheaper, faster and more accessible
DRLib:a Concise Deep Reinforcement Learning Library, Integrating HER, PER and D2SR for Almost Off-Policy RL Algorithms.
RSTutorials: A Curated List of Must-read Papers on Recommender System.
MichaelCola / algorithm-base
Forked from chefyuan/algorithm-base专门为刚开始刷题的同学准备的算法基地,没有最细只有更细,立志用动画将晦涩难懂的算法说的通俗易懂!
PyTorch implementations of deep reinforcement learning algorithms and environments
MichaelCola / leetcode
Forked from azl397985856/leetcodeLeetCode Solutions: A Record of My Problem Solving Journey.( leetcode题解,记录自己的leetcode解题之路。)
此项目是机器学习(Machine Learning)、深度学习(Deep Learning)、NLP面试中常考到的知识点和代码实现,也是作为一个算法工程师必会的理论基础知识。
中国大学MOOC《机器人操作系统入门》课程代码示例
🔥🔥High-Performance Face Recognition Library on PaddlePaddle & PyTorch🔥🔥
papers about Face Detection; Face Alignment; Face Recognition && Face Identification && Face Verification && Face Representation; Face Reconstruction; Face Tracking; Face Super-Resolution && Face D…