Skip to content
View dementrock's full-sized avatar

Organizations

@rll @rllab

Block or report dementrock

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 16+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.

Python 8,128 653 Updated May 23, 2025
Python 801 184 Updated Mar 24, 2023

A job launching library for docker, EC2, GCP, etc.

Python 57 39 Updated Aug 10, 2021

A Neural Net Training Interface on TensorFlow, with focus on speed + flexibility

Python 6,306 1,802 Updated Aug 6, 2023

Obfuscated Gradients Give a False Sense of Security: Circumventing Defenses to Adversarial Examples

Jupyter Notebook 891 172 Updated Jun 10, 2023

StarCraft II Learning Environment

Python 8,126 1,166 Updated Jul 23, 2024

µniverse: RL environments for HTML5 games

JavaScript 365 22 Updated Jan 3, 2019

This is my implementation of the Optimality Tightening

Python 37 8 Updated Apr 26, 2017

Implementation of Policy Gradient algorithms in PyTorch. (Sequential, Distributed sync + async)

Python 9 2 Updated Nov 7, 2017

A customisable 3D platform for agent-based AI research

C 7,226 1,385 Updated Jan 4, 2023

A starter agent that can solve a number of universe environments.

Python 1,098 315 Updated Apr 7, 2018

Universe: a software platform for measuring and training an AI's general intelligence across the world's supply of games, websites and other applications.

Python 7,511 946 Updated Apr 5, 2018

Implementation of the paper [Using Fast Weights to Attend to the Recent Past](https://arxiv.org/abs/1610.06258)

Python 172 22 Updated Nov 3, 2016

Code for the paper "Curiosity-driven Exploration in Deep Reinforcement Learning via Bayesian Neural Networks"

Python 343 92 Updated Nov 22, 2018

Code for reproducing key results in the paper "InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets"

Python 1,062 304 Updated Mar 25, 2021

Fast Recurrent Networks Library

C++ 576 87 Updated Sep 20, 2016
Python 292 97 Updated Mar 13, 2018

M-LOOP: Machine-learning online optimization package

Python 163 56 Updated Aug 14, 2024

A toolkit for developing and comparing reinforcement learning algorithms.

Python 35,996 8,674 Updated Oct 11, 2024

rllab is a framework for developing and evaluating reinforcement learning algorithms, fully compatible with OpenAI Gym.

Python 2,967 800 Updated Jun 10, 2023

PyGame Learning Environment (PLE) -- Reinforcement Learning Environment in Python.

Python 1,036 231 Updated Jan 19, 2022

Variational and semi-supervised neural network toppings for Lasagne

Python 208 31 Updated Aug 25, 2016

Guided Policy Search

Python 599 241 Updated Feb 9, 2021

Neural Turing Machines library in Theano with Lasagne

Python 301 51 Updated Jul 31, 2018

Code for the Neural GPU

48 1 Updated Mar 15, 2016

Common interface for Theano, CGT, and TensorFlow

Python 237 18 Updated Apr 23, 2016

A curated list of deep learning resources for computer vision

10,962 2,780 Updated Aug 15, 2023
JavaScript 131 20 Updated Jun 2, 2024

Model Zoo for Deep Reinforcement Learning

14 2 Updated Dec 19, 2015

2D Game Physics for Python

Python 496 93 Updated Nov 29, 2024
Next