MichaelCola

Follow

🎯

Focusing

MichaelCola MichaelCola

🎯

Focusing

Follow

3 followers · 1 following

BUAA

Stars

Open-Source-O1 / Open-O1

Python 1,143 41 Updated Nov 21, 2024

MichaelCola / Open-O1

Forked from Open-Source-O1/Open-O1

Python 1 Updated Oct 17, 2024

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 38,560 4,741 Updated Jan 21, 2025

microsoft / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 36,312 4,204 Updated Jan 21, 2025

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python 3,855 369 Updated Jan 20, 2025

MichaelCola / LLM-RLHF-Tuning

Forked from Joyce94/LLM-RLHF-Tuning

LLM Tuning with PEFT (SFT+RM+PPO+DPO with LoRA)

Python 1 Updated Oct 11, 2023

meta-llama / llama

Inference code for Llama models

Python 57,278 9,663 Updated Aug 18, 2024

RUCAIBox / LLMSurvey

The official GitHub page for the survey paper "A Survey of Large Language Models".

Python 10,851 846 Updated Aug 20, 2024

GaryYufei / AlignLLMHumanSurvey

Aligning Large Language Models with Human: A Survey

712 31 Updated Sep 11, 2023

opendilab / awesome-RLHF

A curated list of reinforcement learning with human feedback resources (continually updated)

3,632 223 Updated Dec 5, 2024

hpcaitech / ColossalAI

Making large AI models cheaper, faster and more accessible

Python 39,025 4,352 Updated Jan 21, 2025

dalinvip / Awesome-ChatGPT

ChatGPT资料汇总学习，持续更新......

4,113 385 Updated Nov 30, 2024

kaixindelele / DRLib

DRLib：a Concise Deep Reinforcement Learning Library, Integrating HER, PER and D2SR for Almost Off-Policy RL Algorithms.

Python 531 70 Updated Apr 2, 2024

hongleizhang / RSPapers

RSTutorials: A Curated List of Must-read Papers on Recommender System.

6,262 1,354 Updated Aug 21, 2024

MichaelCola / algorithm-base

Forked from chefyuan/algorithm-base

专门为刚开始刷题的同学准备的算法基地，没有最细只有更细，立志用动画将晦涩难懂的算法说的通俗易懂！

Java 1 Updated Oct 15, 2021

p-christ / Deep-Reinforcement-Learning-Algorithms-with-PyTorch

PyTorch implementations of deep reinforcement learning algorithms and environments

Python 5,689 1,203 Updated Jul 25, 2024

MichaelCola / drl_obstacle_avoid

使用深度强化学习，训练避障策略

1 Updated Apr 29, 2021

louisnino / RLcode

Python 907 287 Updated Jan 29, 2023

MichaelCola / leetcode

Forked from azl397985856/leetcode

LeetCode Solutions: A Record of My Problem Solving Journey.( leetcode题解，记录自己的leetcode解题之路。)

JavaScript 1 Updated Dec 21, 2020

NLP-LOVE / ML-NLP

此项目是机器学习(Machine Learning)、深度学习(Deep Learning)、NLP面试中常考到的知识点和代码实现，也是作为一个算法工程师必会的理论基础知识。

Jupyter Notebook 16,266 4,577 Updated Jun 21, 2022

MichaelCola / ROS-Academy-for-Beginners

Forked from DroidAITech/ROS-Academy-for-Beginners

中国大学MOOC《机器人操作系统入门》课程代码示例

CMake 1 Updated Nov 10, 2020

ZhaoJ9014 / face.evoLVe

🔥🔥High-Performance Face Recognition Library on PaddlePaddle & PyTorch🔥🔥

Python 3,475 759 Updated Dec 23, 2022

ChanChiChoi / awesome-Face_Recognition

papers about Face Detection; Face Alignment; Face Recognition && Face Identification && Face Verification && Face Representation; Face Reconstruction; Face Tracking; Face Super-Resolution && Face D…

4,561 966 Updated Feb 9, 2023