-
Microsoft
- Shanghai, China
Highlights
- Pro
-
-
CleanDiffuser Public
Forked from CleanDiffuserTeam/CleanDiffuserCleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making
Python Apache License 2.0 UpdatedJul 1, 2024 -
IsaacLab Public
Forked from isaac-sim/IsaacLabUnified framework for robot learning built on NVIDIA Isaac Sim
Python Other UpdatedJun 19, 2024 -
unitree_mujoco Public
Forked from unitreerobotics/unitree_mujocoC++ BSD 3-Clause "New" or "Revised" License UpdatedJun 17, 2024 -
-
PyTorch-ResNet-CIFAR10 Public
Forked from mtancak/PyTorch-ResNet-CIFAR10Simple ResNet PyTorch project
Python UpdatedNov 3, 2022 -
Programs for Sutton's book "Reinforcement learning: an introduction"
Jupyter Notebook UpdatedJul 21, 2022 -
MajsoulAI Public
Forked from housq/MajsoulAI以JianYangAI作后端,进行在线雀魂对局
Python MIT License UpdatedMar 31, 2022 -
awesome-game-ai Public
Forked from datamllab/awesome-game-aiAwesome Game AI materials of Multi-Agent Reinforcement Learning
MIT License UpdatedMar 25, 2022 -
vlog Public
varitional oracle guiding for reinforcement learning
-
rlcard Public
Forked from datamllab/rlcardReinforcement Learning / AI Bots in Card (Poker) Games - Blackjack, Leduc, Texas, DouDizhu, Mahjong, UNO.
Python MIT License UpdatedJul 7, 2021 -
mahjong Public
Forked from MahjongRepository/mahjongImplementation of riichi mahjong related stuff (hand cost, shanten, agari end, etc.)
Python MIT License UpdatedJan 27, 2021 -
HetFFN- Public
Codes for paper "Lamina-specific neuronal properties promote robust, stable signal propagation in feedforward networks", NeurIPS 2020
-
-
dads Public
Forked from google-research/dadsCode for 'Dynamics-Aware Unsupervised Discovery of Skills' (DADS). Enables skill discovery without supervision, which can be combined with model-based control.
Python Apache License 2.0 UpdatedApr 28, 2020 -
gym-miniworld Public
Forked from Farama-Foundation/MiniworldSimple 3D interior simulator for RL & robotics research
Python Apache License 2.0 UpdatedMar 10, 2020 -
-
VIREL Public
Forked from AnujMahajanOxf/VIRELCode for VIREL: A Variational Inference Framework for Reinforcement Learning
Python UpdatedDec 1, 2019 -
slac Public
Implementation of stochastic latent actor critic (SLAC, https://alexlee-gk.github.io/slac/) in pytorch
-
mahjong-helper Public
Forked from EndlessCheng/mahjong-helper日本麻将助手:牌效+防守+记牌(支持雀魂、天凤)
Go MIT License UpdatedAug 6, 2019 -
tensor2tensor Public
Forked from tensorflow/tensor2tensorLibrary of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.
Python Apache License 2.0 UpdatedJun 20, 2019 -
tensorflow Public
Forked from tensorflow/tensorflowComputation using data flow graphs for scalable machine learning
C++ Apache License 2.0 UpdatedFeb 6, 2018 -
ECEI-tools Public
Data-process toolbox of Electron Cyclotron Emission Imaging
MATLAB UpdatedAug 17, 2017 -
euterpe Public
Forked from tachi-hi/euterpeReal-time Audio-to-audio Karaoke Generation System for Monaural Music
C++ UpdatedMar 27, 2017 -