-
Google
- Paris
- https://alexis-jacq.github.io/
Stars
TorchCFM: a Conditional Flow Matching library
Stable diffusion for real-time music generation
A library of reinforcement learning components and agents
Your browser's reference manager: automatic paper detection (Arxiv, OpenReview & more), publication venue matching and code repository discovery! Also enhances ArXiv: BibTex citation, Markdown link…
hill-a / stable-baselines
Forked from openai/baselinesA fork of OpenAI Baselines, implementations of reinforcement learning algorithms
Deep Planning Network: Control from pixels by latent planning with learned dynamics
Basic versions of agents from Spinning Up in Deep RL written in PyTorch
RL starter files in order to immediately train, visualize and evaluate an agent without writing any line of code
Simple and easily configurable grid world environments for reinforcement learning
Author's PyTorch implementation of TD3 for OpenAI gym tasks
Code release for Learning with Opponent-Learning Awareness and variations.
A flexible tool for creating, organizing, and sharing visualizations of live, rich data. Supports Torch and Numpy.
PyTorch implementation of Value Iteration Networks (VIN): Clean, Simple and Modular. Visualization in Visdom.
Code for the Lovász-Softmax loss (CVPR 2018)
Magenta: Music and Art Generation with Machine Intelligence
An implementation of the Augmented Random Search algorithm
PyTorch implementation of DDPG algorithm for continuous action reinforcement learning problem.
Pytorch implementation of Hinton's Dynamic Routing Between Capsules
A PyTorch implementation of the NIPS 2017 paper "Dynamic Routing Between Capsules".
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKT…
😇A pyTorch implementation of the DeepMoji model: state-of-the-art deep learning model for analyzing sentiment, emotion, sarcasm etc
Learning embeddings for classification, retrieval and ranking.
A working implementation of the Categorical DQN (Distributional RL).
Implement A3C for Mujoco gym envs
Reinforcement learning environments with musculoskeletal models
PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".