High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Python 2 1 Updated Apr 3, 2025

distantmagic / aloni

Async Python framework optimized for IO-heavy applications.

Python 3 Updated Jun 29, 2024

emmett-framework / granian

A Rust HTTP server for Python applications

Rust 3,424 99 Updated Apr 21, 2025

moucheng2017 / SOP-LVM-ICL-Ensemble

[NeurIPS VLM workshop 2024] In-Context Ensemble Learning from Pseudo Labels Improves Video-Language Models for Low-Level Workflow Understanding

Python 22 3 Updated Mar 16, 2025

eza-community / eza

A modern alternative to ls

Rust 15,078 286 Updated Apr 23, 2025

EdanToledo / Stoix

🏛️A research-friendly codebase for fast experimentation of single-agent reinforcement learning in JAX • End-to-End JAX RL

Python 318 33 Updated Apr 23, 2025

dair-ai / ML-Papers-of-the-Week

🔥Highlighting the top ML papers every week.

11,140 680 Updated Apr 11, 2025

LucMc / Website

My personal website

CSS 1 Updated Apr 1, 2021

CAV-Research-Lab / Predictive-Model-Delay-Correction

Python 2 Updated Apr 21, 2023

CAV-Research-Lab / cav-research-lab.github.io

JavaScript 2 Updated Jan 31, 2024

jimmy-academia / AlphaZero-Pytorch-Go-Mahjong

My AlphaZero implementations with Pytorch! (Game of Go; Mahjong)

Python 4 Updated Jan 30, 2020

parinazfa / Recent-Trends-in-Teleoperation-Time-DelayMitigation

Focus on time series prediction methods to solve time delay in teleoperation

Jupyter Notebook 5 Updated Jun 6, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Luc McCutcheon LucMc

Achievements

Achievements

Highlights

Block or report LucMc

Stars

LucMc / PPO-JAX

github / gitignore

rainx0r / metaworld-algorithms

dorjeduck / efficient-kan-jax

shibhansh / loss-of-plasticity

yingchengyang / Reinforcement-Learning-Papers

CAV-Research-Lab / SACLA

rainx0r / mtrl

rainx0r / launchpad

LucMc / mtrl

LucMc / WebCrawler

pytorch-labs / LeanRL

reginald-mclean / cleanrl