High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC

Python 1,168 143 Updated Aug 3, 2023

liuruoze / mini-AlphaStar

(JAIR'2022) A mini-scale reproduction code of the AlphaStar program. Note: the original AlphaStar is the AI proposed by DeepMind to play StarCraft II. JAIR = Journal of Artificial Intelligence Rese…

Python 328 57 Updated Nov 9, 2022

ray-project / ray

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 35,935 6,092 Updated Mar 12, 2025

sail-sg / envpool

C++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.

C++ 1,132 106 Updated Aug 12, 2024

ray-project / rl-experiments

Keeping track of RL experiments

162 25 Updated Dec 17, 2022

Farama-Foundation / MicroRTS

A simple and highly efficient RTS-game-inspired environment for reinforcement learning

Java 297 110 Updated Jul 1, 2024

Farama-Foundation / MicroRTS-Py

A simple and highly efficient RTS-game-inspired environment for reinforcement learning (formerly Gym-MicroRTS)

Python 239 53 Updated Jul 1, 2024

opendilab / DI-engine-docs

DI-engine docs (Chinese and English)

Python 295 63 Updated Mar 10, 2025

Cognitive-AI-Systems / pogema

POGEMA stands for Partially-Observable Grid Environment for Multiple Agents. This is a grid-based environment that was specifically designed to be flexible, tunable and scalable. It can be tailored…

Python 217 25 Updated Sep 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sergey Barichev Alian3785

Block or report Alian3785

Stars

stalkermustang / llm-bulls-and-cows-benchmark

composer / composer

amacati / SoulsAI

PufferAI / PufferLib

dunnolab / xland-minigrid

jax-ml / jax

instadeepai / jumanji

RoyalUr / RoyalUr-Python

sotetsuk / pgx

vgarciasc / mcts-viz

modular / max

amacati / SoulsGym

Farama-Foundation / Gymnasium

joonspk-research / generative_agents

Eclectic-Sheep / sheeprl

baaivision / Emu

opendilab / LightZero

ivankunyankin / ml-agents-frozen-lake

open-data-science / pycourse

danijar / dreamerv3

DLR-RM / stable-baselines3

tinkoff-ai / CORL