Skip to content

Latest commit

 

History

History
161 lines (125 loc) · 13.2 KB

README_bibliography.md

File metadata and controls

161 lines (125 loc) · 13.2 KB

README FILE
Author: Jianyuan (Jet) Yu
Affiliation: Wireless, ECE, Virginia Tech
Email : [email protected]
Date : April, 2018

Bibliography sum up of the Deep Reinforcement Learning on Dynamic Channel Access Project.

Related Files

General Survey

Classic Method

myopic

Whittle

MDP

Recent peer's work


After July 2018

DQN Family & Dr. Silver work

Sutton TextBook

  • Sutton, Richard S., Andrew G. Barto, and Francis Bach. Reinforcement learning: An introduction. MIT press, 1998.
  • BibTex

DQN Family

Method Author Afflicate comment Bibtex paper abbreviation openSource
DQN Mnih Google DeepMind - BibTex paper [mnih2015human] DQN
Double DQN Van Hasselt Google DeepMind - BibTex paper [van2016deep] Double DQN
Prioritized DQN Tom Schaul Google DeepMind - BibTex paper [schaul2015prioritized] Pri DQN
Dueling DQN Wang, Ziyu Google DeepMind - BibTex paper [wang2015dueling] Duel DQN
Asynchronous DQN Mnih Google DeepMind Asynchronous Advantage Actor Critic (A3C) + RNN with continuous action space BibTex paper mnih2016asynchronous] Asyn DQN
Distributional DQN Marc G. Bellemare Google DeepMind - BibTex paper [wang2015dueling]
Noisy Nets DQL Meire Fortunato Google DeepMind - BibTex paper [wang2015dueling]
Rainbow DQN Matteo Hessel Google DeepMind - BibTex paper [hessel2017rainbow]
Deep Deterministic Policy Gradient (DDPG) David Silver Google DeepMind - BibTex paper [silver2014deterministic] DDPG
Distributed Proximal Policy Optimization (DPPO) John Schulman OpenAI - BibTex paper [schulman2017proximal] DDPO

Large Action Size

Dulac-Arnold, Gabriel, et al. "Deep reinforcement learning in large discrete action spaces." arXiv preprint arXiv:1512.07679 (2015).

  • Wolpertinger architecture(similiar to actor-critic)
  • deal with large action-space, ~1 M action

POMDP

How to cite a Github open source

  1. morvan's github for DQN famlily
@misc{Mofan2013,
  author = {Mofan Zhou},
  title = {Reinforcement-learning-with-tensorflow},
  year = {2016},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow}},
  commit = {81fea33905c7f81719ec031eab51c68225eb7cce}
}