Lists (1)
Sort Name ascending (A-Z)
Stars
Code for "Scalable lifelong reinforcement learning"
Lifelong Reinforcement Learning codes. Python implementation for the SR-LLRL Algorithm, proposed in our 2021 IEEE SMC Conference Paper "Accelerating lifelong reinforcement learning via reshaping re…
Code for "LifeLong Incremental Reinforcement Learning (LLIRL)"
This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, SAC, A2C, PPO, TRPO. (More algorithms are…
A Survey on Explainable Reinforcement Learning: Concepts, Algorithms, Challenges
The codes in this repository are my own works that I used in my thesis. It is a python code that prints flight times, set-up times and probabilities of survive for the Weapon Target Assignment Problem
Mean-Shifted Contrastive Loss for Anomaly Detection (AAAI 2023)
A professional list of Tutorials and Surveys on DL, ML, DM, CV, NLP, Speech in top AI conferences and journals.
MultiEMO: An Attention-Based Correlation-Aware Multimodal Fusion Framework for Emotion Recognition in Conversations (ACL 2023)
This repository contains implementations and illustrative code to accompany DeepMind publications
Using Keras and Deep Deterministic Policy Gradient to play TORCS
Run, compile and execute JavaScript for Scientific Computing and Data Visualization TOTALLY TOTALLY TOTALLY in your BROWSER! An open source scientific computing environment for JavaScript TOTALLY i…
Decentralized Autonomous Regulated Company (DARC), a company virtual machine that runs on any EVM-compatible blockchain, with on-chain law system, multi-level tokens and dividends mechanism.
ChatGPT 中文调教指南。各种场景使用指南。学习怎么让它听你的话。
This repo includes ChatGPT prompt curation to use ChatGPT and other LLM tools better.
This repository is to develop novel AIs for complex C2 decision making. It consists of parallel branches for GUI and for AI development (coming soon) for the ARL Battlespace wargame.
A Warmachine (MKIV) tabletop wargaming companion and nemesis application.
This game trains an AI to play dice poker, a deceptively simple game that features incomplete information and bluffing.
The code for the paper 'Temporal Induced Self-Play for Stochastic Bayesian Games' published in IJCAI21.
A study on cooperative games with incomplete information based on the game of Hanabi
Access dict values as attributes (works recursively)
本项目将《动手学深度学习》原书中的MXNet代码实现改为PyTorch实现。