Stars
A mini-framework for evaluating LLM performance on the Bulls and Cows number guessing game, supporting multiple LLM providers.
Distributed RL framework for solving the SoulsGym environments
Simplifying reinforcement learning for complex game environments
JAX-accelerated Meta-Reinforcement Learning Environments Inspired by XLand and MiniGrid 🏎️
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
🕹️ A diverse suite of scalable reinforcement learning environments in JAX
An API to support the playing and analysis of games of The Royal Game of Ur.
Visualization of MCTS algorithm applied to Tic-tac-toe.
Gymnasium extension for DarkSouls III, Elden Ring, and other Souls games
An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)
Generative Agents: Interactive Simulacra of Human Behavior
Distributed Reinforcement Learning accelerated by Lightning Fabric
Emu Series: Generative Multimodal Models from BAAI
[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)
Mastering Diverse Domains through World Models
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC
(JAIR'2022) A mini-scale reproduction code of the AlphaStar program. Note: the original AlphaStar is the AI proposed by DeepMind to play StarCraft II. JAIR = Journal of Artificial Intelligence Rese…
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
C++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.
A simple and highly efficient RTS-game-inspired environment for reinforcement learning
A simple and highly efficient RTS-game-inspired environment for reinforcement learning (formerly Gym-MicroRTS)
POGEMA stands for Partially-Observable Grid Environment for Multiple Agents. This is a grid-based environment that was specifically designed to be flexible, tunable and scalable. It can be tailored…