Skip to content
View LucMc's full-sized avatar

Highlights

  • Pro

Block or report LucMc

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

This repository aims to provide the minimalism of cleanRL with the performance of SBX

Python 2 Updated Apr 24, 2025

A collection of useful .gitignore templates

165,979 83,073 Updated Apr 11, 2025

Implementations of Multi-Task and Meta-Learning baselines for the Metaworld benchmark

Python 4 Updated Apr 24, 2025

JAX port of efficient-kan

Python 4 Updated Jun 24, 2024

Demonstrations of Loss of Plasticity and Implementation of Continual Backpropagation

Python 299 59 Updated Nov 3, 2024

Related papers for reinforcement learning, including classic papers and latest papers in top conferences

410 28 Updated Mar 26, 2025

Neural-Lyapunov-Approximation-SACLA

Python 4 Updated Apr 1, 2025

Multi-task reinforcement learning research

Python 1 2 Updated Mar 20, 2025

Docker images for my ML projects

Dockerfile 1 1 Updated Mar 31, 2025
Python 1 Updated Jan 20, 2025
Python 1 Updated Nov 26, 2024

LeanRL is a fork of CleanRL, where selected PyTorch scripts optimized for performance using compile and cudagraphs.

Python 547 23 Updated Oct 26, 2024

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Python 2 1 Updated Apr 3, 2025

Async Python framework optimized for IO-heavy applications.

Python 3 Updated Jun 29, 2024

A Rust HTTP server for Python applications

Rust 3,424 99 Updated Apr 21, 2025

[NeurIPS VLM workshop 2024] In-Context Ensemble Learning from Pseudo Labels Improves Video-Language Models for Low-Level Workflow Understanding

Python 22 3 Updated Mar 16, 2025

A modern alternative to ls

Rust 15,078 286 Updated Apr 23, 2025

🏛️A research-friendly codebase for fast experimentation of single-agent reinforcement learning in JAX • End-to-End JAX RL

Python 318 33 Updated Apr 23, 2025

🔥Highlighting the top ML papers every week.

11,140 680 Updated Apr 11, 2025

My personal website

CSS 1 Updated Apr 1, 2021

My AlphaZero implementations with Pytorch! (Game of Go; Mahjong)

Python 4 Updated Jan 30, 2020

Focus on time series prediction methods to solve time delay in teleoperation

Jupyter Notebook 5 Updated Jun 6, 2020