Yijie98

Follow

Fortuna Yijie98

Follow

4 followers · 33 following

Starred repositories

86 stars written in Python

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 34,949 5,329 Updated Jan 27, 2025

deepseek-ai / DeepSeek-V3

Python 32,770 3,271 Updated Jan 26, 2025

ShangtongZhang / reinforcement-learning-an-introduction

Python Implementation of Reinforcement Learning: An Introduction

Python 13,767 4,858 Updated Aug 9, 2024

lukas-blecher / LaTeX-OCR

pix2tex: Using a ViT to convert images of equations into LaTeX code.

Python 13,322 1,069 Updated Jan 18, 2025

DLR-RM / stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Python 9,610 1,742 Updated Jan 27, 2025

thu-ml / tianshou

An elegant PyTorch deep reinforcement learning library.

Python 8,144 1,126 Updated Jan 26, 2025

thuml / Time-Series-Library

A Library for Advanced Deep Time Series Models.

Python 7,810 1,250 Updated Jan 10, 2025

py-why / dowhy

DoWhy is a Python library for causal inference that supports explicit modeling and testing of causal assumptions. DoWhy is based on a unified language for causal inference, combining causal graphic…

Python 7,246 941 Updated Jan 21, 2025

vwxyzjn / cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Python 6,129 691 Updated Jan 23, 2025

uber / causalml

Uplift modeling and causal inference with machine learning algorithms

Python 5,190 789 Updated Jan 10, 2025

online-ml / river

🌊 Online machine learning in Python

Python 5,171 552 Updated Dec 6, 2024

diego-vicente / som-tsp

Solving the Traveling Salesman Problem using Self-Organizing Maps

Python 3,886 607 Updated Dec 24, 2023

rlabbe / filterpy

Python Kalman filtering and optimal estimation library. Implements Kalman filter, particle filter, Extended Kalman filter, Unscented Kalman filter, g-h (alpha-beta), least squares, H Infinity, smoo…

Python 3,435 630 Updated Feb 7, 2024

kzl / decision-transformer

Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.

Python 2,462 460 Updated Apr 29, 2024

Thinklab-SJTU / awesome-ml4co

Awesome machine learning for combinatorial optimization papers.

Python 1,776 203 Updated Sep 5, 2024

HumanCompatibleAI / imitation

Clean PyTorch implementations of imitation and reward learning algorithms

Python 1,384 256 Updated Jan 7, 2025

google / active-learning

Python 1,134 207 Updated Dec 5, 2022

huawei-noah / trustworthyAI

Trustworthy AI related projects

Python 1,002 219 Updated Jan 9, 2025

LongxingTan / Time-series-prediction

tfts: Time Series Deep Learning Models in TensorFlow

Python 835 165 Updated Jan 25, 2025

iMoonLab / HGNN

Hypergraph Neural Networks (AAAI 2019)

Python 708 137 Updated Aug 31, 2022

martius-lab / blackbox-backprop

Torch modules that wrap blackbox combinatorial solvers according to the method presented in "Differentiating Blackbox Combinatorial Solvers"

Python 341 40 Updated Dec 21, 2021

FilippoAiraldi / mpc-reinforcement-learning

Reinforcement Learning with Model Predictive Control

Python 339 46 Updated Jan 20, 2025

yihaosun1124 / OfflineRL-Kit

An elegant PyTorch offline reinforcement learning library for researchers.

Python 298 35 Updated Apr 17, 2024

mohmdelsayed / streaming-drl

Deep reinforcement learning without experience replay, target networks, or batch updates.

Python 207 15 Updated Jan 12, 2025

JoshVarty / AlphaZeroSimple

The absolute most basic example of AlphaZero and Monte Carlo Tree Search I could come up with

Python 201 34 Updated Apr 3, 2023

forgi86 / pyMPC

A Model Predictive Control (MPC) Python library based on the OSQP solver.

Python 200 30 Updated Jul 19, 2021

alison-carrera / onn

Online Deep Learning: Learning Deep Neural Networks on the Fly / Non-linear Contextual Bandit Algorithm (ONN_THS)

Python 183 45 Updated Dec 11, 2019

huggingface / jat

General multi-task deep RL Agent

Python 174 11 Updated Jun 6, 2024

thuml / TimeXer

Official implementation for "TimeXer: Empowering Transformers for Time Series Forecasting with Exogenous Variables" (NeurIPS 2024)

Python 155 21 Updated Nov 27, 2024

ugr-sail / sinergym

Gym environment for building simulation and control using reinforcement learning

Python 142 40 Updated Jan 27, 2025

Starred topics

paper