Skip to content
View Alian3785's full-sized avatar

Block or report Alian3785

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A mini-framework for evaluating LLM performance on the Bulls and Cows number guessing game, supporting multiple LLM providers.

HTML 237 1 Updated Jan 31, 2025

Dependency Manager for PHP

PHP 28,877 4,602 Updated Mar 6, 2025

Distributed RL framework for solving the SoulsGym environments

Python 30 2 Updated Apr 28, 2024

Simplifying reinforcement learning for complex game environments

C 1,729 94 Updated Mar 12, 2025

JAX-accelerated Meta-Reinforcement Learning Environments Inspired by XLand and MiniGrid 🏎️

Python 267 19 Updated Nov 16, 2024

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Python 31,612 2,936 Updated Mar 12, 2025

🕹️ A diverse suite of scalable reinforcement learning environments in JAX

Python 701 89 Updated Feb 4, 2025

An API to support the playing and analysis of games of The Royal Game of Ur.

Python 5 3 Updated Feb 27, 2024

♟️ Vectorized RL game environments in JAX

Python 451 33 Updated Mar 6, 2025

Visualization of MCTS algorithm applied to Tic-tac-toe.

JavaScript 228 12 Updated Aug 25, 2021

The MAX Platform (includes Mojo)

Mojo 23,773 2,586 Updated Mar 11, 2025

Gymnasium extension for DarkSouls III, Elden Ring, and other Souls games

Python 125 11 Updated Oct 20, 2024

An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

Python 8,547 954 Updated Mar 6, 2025

Generative Agents: Interactive Simulacra of Human Behavior

18,601 2,461 Updated Aug 5, 2024

Distributed Reinforcement Learning accelerated by Lightning Fabric

Python 351 44 Updated Mar 10, 2025

Emu Series: Generative Multimodal Models from BAAI

Python 1,692 87 Updated Sep 27, 2024

[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)

Python 1,300 141 Updated Mar 11, 2025

python курс

37 6 Updated Mar 8, 2025

Mastering Diverse Domains through World Models

Python 1,536 259 Updated Feb 22, 2025

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Python 10,052 1,787 Updated Mar 6, 2025

High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC

Python 1,168 143 Updated Aug 3, 2023

(JAIR'2022) A mini-scale reproduction code of the AlphaStar program. Note: the original AlphaStar is the AI proposed by DeepMind to play StarCraft II. JAIR = Journal of Artificial Intelligence Rese…

Python 328 57 Updated Nov 9, 2022

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 35,935 6,092 Updated Mar 12, 2025

C++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.

C++ 1,132 106 Updated Aug 12, 2024

Keeping track of RL experiments

162 25 Updated Dec 17, 2022

A simple and highly efficient RTS-game-inspired environment for reinforcement learning

Java 297 110 Updated Jul 1, 2024

A simple and highly efficient RTS-game-inspired environment for reinforcement learning (formerly Gym-MicroRTS)

Python 239 53 Updated Jul 1, 2024

DI-engine docs (Chinese and English)

Python 295 63 Updated Mar 10, 2025

POGEMA stands for Partially-Observable Grid Environment for Multiple Agents. This is a grid-based environment that was specifically designed to be flexible, tunable and scalable. It can be tailored…

Python 217 25 Updated Sep 18, 2024
Next