JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"
Mastering Diverse Domains through World Models
Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.
A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.
SBX: Stable Baselines Jax (SB3 + Jax) RL algorithms
[NeurIPS'22 Spotlight] When to Trust Your Simulator: Dynamics-Aware Hybrid Offline-and-Online Reinforcement Learning
PyTorch implementation of SAC-Discrete.
Implementation of Trajectory Transformer with attention caching and batched beam search