-
pytorch-maddpg Public
Forked from xuehy/pytorch-maddpgA pytorch implementation of MADDPG (multi-agent deep deterministic policy gradient)
Python UpdatedJun 5, 2018 -
RLSeq2Seq Public
Forked from yaserkl/RLSeq2SeqDeep Reinforcement Learning For Sequence to Sequence Models
Python MIT License UpdatedMay 29, 2018 -
DA-RNN Public
Forked from Zhenye-Na/DA-RNNImplementation of DA-RNN (arXiv:1704.02971) 🚧
Python UpdatedMay 23, 2018 -
-
TD3 Public
Forked from sfujim/TD3PyTorch implementation of TD3 and DDPG for OpenAI gym tasks
Python UpdatedMay 18, 2018 -
tensorflow-lstm-regression Public
Forked from polyaxon/hauptSequence prediction using recurrent neural networks(LSTM) with TensorFlow
Jupyter Notebook MIT License UpdatedMay 16, 2018 -
QueryReformulator Public
Forked from nyu-dl/dl4ir-query-reformulatorPython BSD 3-Clause "New" or "Revised" License UpdatedMay 15, 2018 -
StarSpace Public
Forked from facebookresearch/StarSpaceLearning embeddings for classification, retrieval and ranking.
C++ Other UpdatedMay 4, 2018 -
-
JointNRE Public
Forked from thunlp/JointNREJoint Neural Relation Extraction with Text and KGs
Python MIT License UpdatedApr 13, 2018 -
AlphaZero_ChineseChess Public
Forked from TDteach/AlphaZero_ChineseChessPython UpdatedApr 10, 2018 -
irl-imitation Public
Forked from yrlu/irl-imitationImplementation of Inverse Reinforcement Learning (IRL) algorithms in python/Tensorflow. Deep MaxEnt, MaxEnt, LPIRL
Python UpdatedApr 2, 2018 -
models Public
Forked from tensorflow/modelsModels and examples built with TensorFlow
Python Apache License 2.0 UpdatedMar 30, 2018 -
awd-lstm-lm Public
Forked from salesforce/awd-lstm-lmPython BSD 3-Clause "New" or "Revised" License UpdatedMar 29, 2018 -
Reinforcement-learning-with-tensorflow Public
Forked from MorvanZhou/Reinforcement-learning-with-tensorflowSimple Reinforcement learning tutorials
Python MIT License UpdatedMar 29, 2018 -
pytorch-a2c-ppo-acktr Public
Forked from ikostrikov/pytorch-a2c-ppo-acktr-gailPyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO) and Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (A…
Python MIT License UpdatedMar 28, 2018 -
gym Public
Forked from openai/gymA toolkit for developing and comparing reinforcement learning algorithms.
Python Other UpdatedMar 28, 2018 -
reversi-alpha-zero Public
Forked from mokemokechicken/reversi-alpha-zeroReversi reinforcement learning by AlphaGo Zero methods.
Python MIT License UpdatedMar 27, 2018 -
examples Public
Forked from pytorch/examplesA set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
Python BSD 3-Clause "New" or "Revised" License UpdatedMar 26, 2018 -
ENAS-pytorch Public
Forked from carpedm20/ENAS-pytorchPyTorch implementation of "Efficient Neural Architecture Search via Parameters Sharing"
Python Apache License 2.0 UpdatedMar 26, 2018 -
image_captioning Public
Forked from DeepRNN/image_captioningTensorflow implementation of "Show, Attend and Tell: Neural Image Caption Generation with Visual Attention"
Python MIT License UpdatedMar 22, 2018 -
LeakGAN Public
Forked from CR-Gjx/LeakGANThe codes of paper "Long Text Generation via Adversarial Training with Leaked Information" on AAAI 2018. Text generation using GAN and Hierarchical Reinforcement Learning.
Python UpdatedMar 17, 2018 -
reinforcement-learning Public
Forked from dennybritz/reinforcement-learningImplementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.
Jupyter Notebook MIT License UpdatedMar 8, 2018 -
seq2seq-signal-prediction Public
Forked from guillaume-chevalier/seq2seq-signal-predictionSignal prediction with a Sequence-to-Sequence (seq2seq) Recurrent Neural Network (RNN) model in TensorFlow - Guillaume Chevalier
Jupyter Notebook MIT License UpdatedMar 6, 2018 -
tensorflow-GANs Public
Forked from TwistedW/tensorflow-GANs各类GAN综合在一起,借鉴了hwalsuklee大神的
Python UpdatedMar 5, 2018 -
chess-alpha-zero Public
Forked from Zeta36/chess-alpha-zeroChess reinforcement learning by AlphaGo Zero methods.
Jupyter Notebook MIT License UpdatedMar 5, 2018 -
intentMARL Public
Forked from SiyuanQi-zz/intentMARLCode for ICRA2018 - Intent-aware Multi-agent Reinforcement Learning.
Python MIT License UpdatedFeb 22, 2018 -
show-attend-and-tell Public
Forked from yunjey/show-attend-and-tellTensorFlow Implementation of "Show, Attend and Tell"
Jupyter Notebook MIT License UpdatedFeb 9, 2018 -
DeepLearningNotes Public
Forked from AlphaSmartDog/DeepLearningNotes机器学习和量化分析学习进行中
Jupyter Notebook MIT License UpdatedFeb 3, 2018 -
rl-teacher Public
Forked from nottombrown/rl-teacherCode for Deep RL from Human Preferences [Christiano et al]. Plus a webapp for collecting human feedback
Python MIT License UpdatedDec 26, 2017