dementrock

Follow

Rocky Duan dementrock

Follow

302 followers · 56 following

http://www.rockyduan.com

Achievements

Achievements

Organizations

Stars

skypilot-org / skypilot

SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 16+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.

Python 8,128 653 Updated May 23, 2025

eg4000 / SKU110K_CVPR19

Python 801 184 Updated Mar 24, 2023

justinjfu / doodad

A job launching library for docker, EC2, GCP, etc.

Python 57 39 Updated Aug 10, 2021

tensorpack / tensorpack

A Neural Net Training Interface on TensorFlow, with focus on speed + flexibility

Python 6,306 1,802 Updated Aug 6, 2023

anishathalye / obfuscated-gradients

Obfuscated Gradients Give a False Sense of Security: Circumventing Defenses to Adversarial Examples

Jupyter Notebook 891 172 Updated Jun 10, 2023

google-deepmind / pysc2

StarCraft II Learning Environment

Python 8,126 1,166 Updated Jul 23, 2024

unixpickle / muniverse

µniverse: RL environments for HTML5 games

JavaScript 365 22 Updated Jan 3, 2019

ShibiHe / Q-Optimality-Tightening

This is my implementation of the Optimality Tightening

Python 37 8 Updated Apr 26, 2017

seba-1511 / drl.pth

Implementation of Policy Gradient algorithms in PyTorch. (Sequential, Distributed sync + async)

Python 9 2 Updated Nov 7, 2017

google-deepmind / lab

A customisable 3D platform for agent-based AI research

C 7,226 1,385 Updated Jan 4, 2023

openai / universe-starter-agent

A starter agent that can solve a number of universe environments.

Python 1,098 315 Updated Apr 7, 2018

openai / universe

Universe: a software platform for measuring and training an AI's general intelligence across the world's supply of games, websites and other applications.

Python 7,511 946 Updated Apr 5, 2018

jiamings / fast-weights

Implementation of the paper [Using Fast Weights to Attend to the Recent Past](https://arxiv.org/abs/1610.06258)

Python 172 22 Updated Nov 3, 2016

openai / vime

Code for the paper "Curiosity-driven Exploration in Deep Reinforcement Learning via Bayesian Neural Networks"

Python 343 92 Updated Nov 22, 2018

openai / InfoGAN

Code for reproducing key results in the paper "InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets"

Python 1,062 304 Updated Mar 25, 2021

baidu-research / persistent-rnn

Fast Recurrent Networks Library

C++ 576 87 Updated Sep 20, 2016

jych / nips2015_vrnn

Python 292 97 Updated Mar 13, 2018

michaelhush / M-LOOP

M-LOOP: Machine-learning online optimization package

Python 163 56 Updated Aug 14, 2024

openai / gym

A toolkit for developing and comparing reinforcement learning algorithms.

Python 35,996 8,674 Updated Oct 11, 2024

rll / rllab

rllab is a framework for developing and evaluating reinforcement learning algorithms, fully compatible with OpenAI Gym.

Python 2,967 800 Updated Jun 10, 2023

ntasfi / PyGame-Learning-Environment

PyGame Learning Environment (PLE) -- Reinforcement Learning Environment in Python.

Python 1,036 231 Updated Jan 19, 2022

casperkaae / parmesan

Variational and semi-supervised neural network toppings for Lasagne

Python 208 31 Updated Aug 25, 2016

cbfinn / gps

Guided Policy Search

Python 599 241 Updated Feb 9, 2021

snipsco / ntm-lasagne

Neural Turing Machines library in Theano with Lasagne

Python 301 51 Updated Jul 31, 2018

lukaszkaiser / NeuralGPU

Code for the Neural GPU

48 1 Updated Mar 15, 2016

dementrock / tensorfuse

Common interface for Theano, CGT, and TensorFlow

Python 237 18 Updated Apr 23, 2016

kjw0612 / awesome-deep-vision

A curated list of deep learning resources for computer vision

10,962 2,780 Updated Aug 15, 2023

janismac / ControlChallenges

JavaScript 131 20 Updated Jun 2, 2024

michalkoziarski / Deep-RL-Zoo

Model Zoo for Deep Reinforcement Learning

14 2 Updated Dec 19, 2015

pybox2d / pybox2d

2D Game Physics for Python

Python 496 93 Updated Nov 29, 2024