Stars
This is the open source code of Cumulative Curriculum Reinforcement Learning (CCRL)
This is a concise Pytorch implementation of Rainbow DQN, including Double Q-learning, Dueling network, Noisy network, PER and n-steps Q-learning.
OpenDILab Decision AI Engine. The Most Comprehensive Reinforcement Learning Framework B.P.
Concise pytorch implements of MARL algorithms, including MAPPO, MADDPG, MATD3, QMIX and VDN.
Implementations of MAPPO and IPPO on SMAC, the multi-agent StarCraft environment.
Concise pytorch implements of DRL algorithms, including REINFORCE, A2C, DQN, PPO(discrete and continuous), DDPG, TD3, SAC.
A clean and robust implementation of Duel Double DQN
Solve BipedalWalkerHardcore-v2 with TD3