Skip to content
View dementrock's full-sized avatar

Organizations

@rll @rllab

Block or report dementrock

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 12+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.

Python 7,072 550 Updated Jan 22, 2025
Python 786 182 Updated Mar 24, 2023

A job launching library for docker, EC2, GCP, etc.

Python 57 39 Updated Aug 10, 2021

A Neural Net Training Interface on TensorFlow, with focus on speed + flexibility

Python 6,302 1,807 Updated Aug 6, 2023

Obfuscated Gradients Give a False Sense of Security: Circumventing Defenses to Adversarial Examples

Jupyter Notebook 885 171 Updated Jun 10, 2023

StarCraft II Learning Environment

Python 8,055 1,158 Updated Jul 23, 2024

µniverse: RL environments for HTML5 games

JavaScript 365 22 Updated Jan 3, 2019

This is my implementation of the Optimality Tightening

Python 37 8 Updated Apr 26, 2017

Implementation of Policy Gradient algorithms in PyTorch. (Sequential, Distributed sync + async)

Python 9 2 Updated Nov 7, 2017

A customisable 3D platform for agent-based AI research

C 7,161 1,369 Updated Jan 4, 2023

A starter agent that can solve a number of universe environments.

Python 1,099 317 Updated Apr 7, 2018

Universe: a software platform for measuring and training an AI's general intelligence across the world's supply of games, websites and other applications.

Python 7,492 951 Updated Apr 5, 2018

Implementation of the paper [Using Fast Weights to Attend to the Recent Past](https://arxiv.org/abs/1610.06258)

Python 172 22 Updated Nov 3, 2016

Code for the paper "Curiosity-driven Exploration in Deep Reinforcement Learning via Bayesian Neural Networks"

Python 341 91 Updated Nov 22, 2018

Code for reproducing key results in the paper "InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets"

Python 1,059 307 Updated Mar 25, 2021

Fast Recurrent Networks Library

C++ 580 87 Updated Sep 20, 2016
Python 292 97 Updated Mar 13, 2018

M-LOOP: Machine-learning online optimization package

Python 160 56 Updated Aug 14, 2024

A toolkit for developing and comparing reinforcement learning algorithms.

Python 35,188 8,623 Updated Oct 11, 2024

rllab is a framework for developing and evaluating reinforcement learning algorithms, fully compatible with OpenAI Gym.

Python 2,929 801 Updated Jun 10, 2023

PyGame Learning Environment (PLE) -- Reinforcement Learning Environment in Python.

Python 1,023 231 Updated Jan 19, 2022

Variational and semi-supervised neural network toppings for Lasagne

Python 208 31 Updated Aug 25, 2016

Guided Policy Search

Python 599 241 Updated Feb 9, 2021

Neural Turing Machines library in Theano with Lasagne

Python 300 51 Updated Jul 31, 2018

Code for the Neural GPU

46 1 Updated Mar 15, 2016

Common interface for Theano, CGT, and TensorFlow

Python 236 20 Updated Apr 23, 2016

A curated list of deep learning resources for computer vision

10,872 2,779 Updated Aug 15, 2023
JavaScript 130 18 Updated Jun 2, 2024

Model Zoo for Deep Reinforcement Learning

14 2 Updated Dec 19, 2015

2D Game Physics for Python

Python 485 93 Updated Nov 29, 2024
Next