Starred repositories
A high-throughput and memory-efficient inference and serving engine for LLMs
Python Implementation of Reinforcement Learning: An Introduction
pix2tex: Using a ViT to convert images of equations into LaTeX code.
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
An elegant PyTorch deep reinforcement learning library.
A Library for Advanced Deep Time Series Models.
DoWhy is a Python library for causal inference that supports explicit modeling and testing of causal assumptions. DoWhy is based on a unified language for causal inference, combining causal graphic…
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Uplift modeling and causal inference with machine learning algorithms
Solving the Traveling Salesman Problem using Self-Organizing Maps
Python Kalman filtering and optimal estimation library. Implements Kalman filter, particle filter, Extended Kalman filter, Unscented Kalman filter, g-h (alpha-beta), least squares, H Infinity, smoo…
Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.
Awesome machine learning for combinatorial optimization papers.
Clean PyTorch implementations of imitation and reward learning algorithms
tfts: Time Series Deep Learning Models in TensorFlow
Torch modules that wrap blackbox combinatorial solvers according to the method presented in "Differentiating Blackbox Combinatorial Solvers"
Reinforcement Learning with Model Predictive Control
An elegant PyTorch offline reinforcement learning library for researchers.
Deep reinforcement learning without experience replay, target networks, or batch updates.
The absolute most basic example of AlphaZero and Monte Carlo Tree Search I could come up with
A Model Predictive Control (MPC) Python library based on the OSQP solver.
Online Deep Learning: Learning Deep Neural Networks on the Fly / Non-linear Contextual Bandit Algorithm (ONN_THS)
Official implementation for "TimeXer: Empowering Transformers for Time Series Forecasting with Exogenous Variables" (NeurIPS 2024)
Gym environment for building simulation and control using reinforcement learning