-
Rutgers University
- New Brunswick
- herowanzhu.github.io
Stars
🤖 Elegant implementations of offline safe RL algorithms in PyTorch
A comprehensive list of PAPERS, CODEBASES, and, DATASETS on Decision Making using Foundation Models including LLMs and VLMs.
Awesome things about LLM-powered agents. Papers / Repos / Blogs / ...
🚀 A fast safe reinforcement learning library in PyTorch
RLeXplore provides stable baselines of exploration methods in reinforcement learning, such as intrinsic curiosity module (ICM), random network distillation (RND) and rewarding impact-driven explora…
The repository is for safe reinforcement learning baselines.
Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
A collection of reference environments for offline reinforcement learning
Find best-response to a fixed policy in multi-agent RL
A fast trajectory optimization library written in Julia
Guaranteed Sequential Trajectory Optimization (GuSTO), using sequential convex programming for trajectory optimization with strong theoretical guarantees
Distributed reliable key-value store for the most critical data of a distributed system
Hypothesis is a powerful, flexible, and easy to use library for property-based testing.
Generative Adversarial Network to Develop Synthetic Dialogues
Podman: A tool for managing OCI containers and pods.
Simple operating system in C++, written from scratch
Modern transactional key-value/row storage library.