Skip to content
View MichaelCola's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report MichaelCola

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 1,143 41 Updated Nov 21, 2024
Python 1 Updated Oct 17, 2024

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 38,560 4,741 Updated Jan 21, 2025

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 36,312 4,204 Updated Jan 21, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python 3,855 369 Updated Jan 20, 2025

LLM Tuning with PEFT (SFT+RM+PPO+DPO with LoRA)

Python 1 Updated Oct 11, 2023

Inference code for Llama models

Python 57,278 9,663 Updated Aug 18, 2024

The official GitHub page for the survey paper "A Survey of Large Language Models".

Python 10,851 846 Updated Aug 20, 2024

Aligning Large Language Models with Human: A Survey

712 31 Updated Sep 11, 2023

A curated list of reinforcement learning with human feedback resources (continually updated)

3,632 223 Updated Dec 5, 2024

Making large AI models cheaper, faster and more accessible

Python 39,025 4,352 Updated Jan 21, 2025

ChatGPT资料汇总学习,持续更新......

4,113 385 Updated Nov 30, 2024

DRLib:a Concise Deep Reinforcement Learning Library, Integrating HER, PER and D2SR for Almost Off-Policy RL Algorithms.

Python 531 70 Updated Apr 2, 2024

RSTutorials: A Curated List of Must-read Papers on Recommender System.

6,262 1,354 Updated Aug 21, 2024

专门为刚开始刷题的同学准备的算法基地,没有最细只有更细,立志用动画将晦涩难懂的算法说的通俗易懂!

Java 1 Updated Oct 15, 2021

PyTorch implementations of deep reinforcement learning algorithms and environments

Python 5,689 1,203 Updated Jul 25, 2024

使用深度强化学习,训练避障策略

1 Updated Apr 29, 2021
Python 907 287 Updated Jan 29, 2023

LeetCode Solutions: A Record of My Problem Solving Journey.( leetcode题解,记录自己的leetcode解题之路。)

JavaScript 1 Updated Dec 21, 2020

此项目是机器学习(Machine Learning)、深度学习(Deep Learning)、NLP面试中常考到的知识点和代码实现,也是作为一个算法工程师必会的理论基础知识。

Jupyter Notebook 16,266 4,577 Updated Jun 21, 2022

中国大学MOOC《机器人操作系统入门》课程代码示例

CMake 1 Updated Nov 10, 2020

🔥🔥High-Performance Face Recognition Library on PaddlePaddle & PyTorch🔥🔥

Python 3,475 759 Updated Dec 23, 2022

papers about Face Detection; Face Alignment; Face Recognition && Face Identification && Face Verification && Face Representation; Face Reconstruction; Face Tracking; Face Super-Resolution && Face D…

4,561 966 Updated Feb 9, 2023