DeepRL algorithms implementation easy for understanding and reading with Pytorch and Tensorflow 2(DQN, REINFORCE, VPG, A2C, TRPO, PPO, DDPG, TD3, SAC)

Python 329 39 Updated Mar 25, 2023

allenai / FineGrainedRLHF

Python 269 22 Updated Jan 6, 2025

efeslab / fiddler

[ICLR'25] Fast Inference of MoE Models with CPU-GPU Orchestration

Python 197 18 Updated Nov 18, 2024

dhruvramani / Transformers-RL

An easy PyTorch implementation of "Stabilizing Transformers for Reinforcement Learning"

Python 179 23 Updated Feb 21, 2023

IntelLabs / matsciml

Open MatSci ML Toolkit is a framework for prototyping and scaling out deep learning models for materials discovery supporting widely used materials science datasets, and built on top of PyTorch Lig…

Python 169 26 Updated Mar 4, 2025

openlifescience-ai / Open-Medical-Reasoning-Tasks

A comprehensive repository of reasoning tasks for Medical LLMs (and beyond)

Python 113 10 Updated Sep 16, 2024

ezliu / dream

Decoupled Reward-free ExplorAtion and Execution for Meta-reinforcement learning

Python 90 14 Updated Feb 13, 2023

lucaslingle / pytorch_rl2

Implementation of 'RL^2: Fast Reinforcement Learning via Slow Reinforcement Learning'

Python 58 10 Updated Jan 1, 2022

cs182sp21 / hw4_student

Python 3 22 Updated Mar 28, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Young Ko YoungKo

Highlights

Block or report YoungKo

Lists (2)

LLM

RL

Stars

python / cpython

microsoft / JARVIS

huggingface / peft

jadore801120 / attention-is-all-you-need-pytorch

LargeWorldModel / LWM

google-deepmind / graphcast

seungeunrho / minimalRL

ridgerchu / matmulfreellm

noahshinn / reflexion

spcl / graph-of-thoughts

lucidrains / self-rewarding-lm-pytorch

tristandeleu / pytorch-maml-rl

sail-sg / lorahub

katerakelly / oyster

RITCHIEHuang / DeepRL_Algorithms