JoneNash / awesome-deep-rl Public

forked from tigerneil/awesome-deep-rl

Notifications You must be signed in to change notification settings
Fork 0
Star 0

This project is for learning and researching on Deep RL. Maintained by University AI researchers.

0 stars 217 forks Branches Tags Activity

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 186 Commits
images		images
ACER.md		ACER.md
BiCNet.md		BiCNet.md
C51-analysis.md		C51-analysis.md
C51.md		C51.md
COMA.md		COMA.md
D4PG.md		D4PG.md
DDPG.md		DDPG.md
DDQN.md		DDQN.md
DEBP.md		DEBP.md
DPPO.md		DPPO.md
DQN.md		DQN.md
DQfD.md		DQfD.md
Distral.md		Distral.md
DualMDP.md		DualMDP.md
Dueling.md		Dueling.md
ECMAC.md		ECMAC.md
EPG.md		EPG.md
EX2.md		EX2.md
GANAC.md		GANAC.md
GVG.md		GVG.md
HIRL.md		HIRL.md
I2As.md		I2As.md
IBP.md		IBP.md
IPG.md		IPG.md
IQN.md		IQN.md
LFOD.md		LFOD.md
LICENSE		LICENSE
LOLA.md		LOLA.md
MADDPG.md		MADDPG.md
MBDQN.md		MBDQN.md
MCAI.md		MCAI.md
MMRB.md		MMRB.md
MSRL.md		MSRL.md
NDM.md		NDM.md
NEC.md		NEC.md
NoisyNet.md		NoisyNet.md
OP-GAIL.md		OP-GAIL.md
PCL.md		PCL.md
PEB.md		PEB.md
PER.md		PER.md
PGQ.md		PGQ.md
PGSQL.md		PGSQL.md
PPO.md		PPO.md
PhiEB.md		PhiEB.md
Programmable.md		Programmable.md
QEnsemble.md		QEnsemble.md
QPROP.md		QPROP.md
QR-DQN.md		QR-DQN.md
REACTOR.md		REACTOR.md
README.md		README.md
RECUR.md		RECUR.md
REETDQN.md		REETDQN.md
RLCRC.md		RLCRC.md
RLP.md		RLP.md
RLTUNER.md		RLTUNER.md
Rainbow.md		Rainbow.md
RoboSumo.md		RoboSumo.md
TRPO.md		TRPO.md
UBE.md		UBE.md
UML.md		UML.md
UNREAL.md		UNREAL.md
VALOR.md		VALOR.md
ZSTG.md		ZSTG.md
content.md		content.md
dmimic.md		dmimic.md
incentivizing.md		incentivizing.md

Repository files navigation

Awesome-deep-reinforcement-learning

Explicitly show the relationships between various techniques of deep reinforcement learning methods. Dedicated for learning and researching on DRL. This project is for learning and researching on DRL. This area is so hot that everyday we can see new ideas happen. I would like to give an explicit landscape for deep rl, one reason is for aquire the better understanding of existing methods and theoretical results, the other is to seek potential developments based on these findings. Any suggestion/improvement is welcomed.

Recommendations and suggestions are welcome.

Value based methods

TreeQN and ATreeC: Differentiable Tree-Structured Models for Deep Reinforcement Learning 8 Mar 2018
DISTRIBUTED PRIORITIZED EXPERIENCE REPLAY 2 Mar 2018
Rainbow: Combining Improvements in Deep Reinforcement Learning 6 Oct 2017
Learning from Demonstrations for Real World Reinforcement Learning 12 Apr 2017
Dueling Network Architecture
Double DQN
Prioritized Experience
Deep Q-Networks

Policy gradient methods

Explorations in DRL

The Uncertainty Bellman Equation and Exploration 15 Sep 2017
Noisy Networks for Exploration 30 Jun 2017 implementation
Count-Based Exploration in Feature Space for Reinforcement Learning 25 Jun 2017
Count-Based Exploration with Neural Density Models 14 Jun 2017
UCB and InfoGain Exploration via Q-Ensembles 11 Jun 2017
Minimax Regret Bounds for Reinforcement Learning 16 Mar 2017
Incentivizing Exploration In Reinforcement Learning With Deep Predictive Models
EX2: Exploration with Exemplar Models for Deep Reinforcement Learning

Actor-Critic methods

Model-based methods

Model-Based Stabilisation of Deep Reinforcement Learning 6 Sep 2018
Learning model-based planning from scratch 19 July 2017

Model-free + Model-based

Imagination-Augmented Agents for Deep Reinforcement Learning 19 July 2017

Option

Variational Option Discovery Algorithms 26 July 2018
A Laplacian Framework for Option Discovery in Reinforcement Learning 16 Jun 2017

Connection with other methods

Connecting value and policy methods

Reward design

Reinforcement Learning with Corrupted Reward Channel 23 May 2017

Unifying

Multi-step Reinforcement Learning: A Unifying Algorithm

Faster DRL

Neural Episodic Control

Apply RL to other domains

TUNING RECURRENT NEURAL NETWORKS WITH REINFORCEMENT LEARNING

Multiagent Settings

New design

Multitask

Zero-Shot Task Generalization with Multi-Task Deep Reinforcement Learning 7 Nov 2017
Distral: Robust Multitask Reinforcement Learning 13 July 2017

Observational Learning

Observational Learning by Reinforcement Learning 20 Jun 2017

Meta Learning

Unsupervised Meta-Learning for Reinforcement Learning 12 Jun 2018

Distributional

Implicit Quantile Networks for Distributional Reinforcement Learning 14 Jun 2018
DISTRIBUTED DISTRIBUTIONAL DETERMINISTIC POLICY GRADIENTS 23 Apr 2018
An Analysis of Categorical Distributional Reinforcement Learning 22 Feb 2018
Distributional Reinforcement Learning with Quantile Regression 27 Oct 2017
A Distributional Perspective on Reinforcement Learning 21 July 2017

Inverse RL

ADDRESSING SAMPLE INEFFICIENCY AND REWARD BIAS IN INVERSE REINFORCEMENT LEARNING 9 Sep 2018

Time

Time Limits in Reinforcement Learning

Applications

DeepMimic: Example-Guided Deep Reinforcement Learning of Physics-Based Character Skills 9 Apr 2018

About

This project is for learning and researching on Deep RL. Maintained by University AI researchers.

Report repository

Releases

No releases published

Packages

No packages published