Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)

Python 2,521 319 Updated May 10, 2025

XU-YIJIE / grpo-flat

Train your grpo with zero dataset and low resources, 8bit/4bit/lora/qlora supported, multi-gpu supported ...

Python 71 7 Updated Apr 30, 2025

wnzhang / rtb-papers

A collection of research and survey papers of real-time bidding (RTB) based display advertising techniques.

3,636 936 Updated Dec 20, 2024

huangsg1 / Internet-advertising-mechanism-and-strategy

计算广告机制策略相关材料整理(A collection of research and application papers about Strategy in Internet advertising.)

161 22 Updated Feb 18, 2024

mml-book / mml-book.github.io

Companion webpage to the book "Mathematics For Machine Learning"

Jupyter Notebook 14,121 2,563 Updated Mar 13, 2025

sweetice / Deep-reinforcement-learning-with-pytorch

PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

Python 4,306 876 Updated Mar 24, 2023

nobodymx / SORL-for-Auto-bidding

codes for SORL framework for auto-bidding

Python 40 11 Updated Oct 13, 2022

alimama-tech / AuctionNet

AuctionNet: A Novel Benchmark for Decision-Making in Large-Scale Games

Python 158 12 Updated Apr 25, 2025

datawhalechina / easy-rl

强化学习中文教程（蘑菇书🍄），在线阅读地址：https://datawhalechina.github.io/easy-rl/

Jupyter Notebook 11,350 2,029 Updated May 13, 2025

wwxFromTju / awesome-reinforcement-learning-lib

GitHub's code repository is all you need

349 42 Updated Mar 21, 2023

Doragd / Algorithm-Practice-in-Industry

搜索、推荐、广告、用增等工业界实践文章收集（来源：知乎、Datafuntalk、技术公众号）

Python 3,387 391 Updated May 23, 2025

venkatacrc / Budget_Constrained_Bidding

Budget Constrained Bidding for Display Advertising using Model-free Reinforcement Learning

Python 42 20 Updated Dec 13, 2019

liguodongiot / llm-action

本项目旨在分享大模型相关技术原理以及实战经验（大模型工程化、大模型应用落地）

HTML 17,801 2,097 Updated May 1, 2025

cbamls / AI_Tutorial

精选机器学习，NLP，图像识别，深度学习等人工智能领域学习资料，搜索，推荐，广告系统架构及算法技术资料整理。算法大牛笔记汇总

3,434 503 Updated Apr 15, 2024

datawhalechina / leedl-tutorial

《李宏毅深度学习教程》（李宏毅老师推荐👍，苹果书🍎），PDF下载地址：https://github.com/datawhalechina/leedl-tutorial/releases

Jupyter Notebook 15,142 3,025 Updated May 13, 2025

zhaoweih / Clash-Copy

📁Clash客户端备份-Clash client backup

403 199 Updated Nov 28, 2024

mlabonne / llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Jupyter Notebook 52,217 5,581 Updated May 12, 2025

udacity / deep-reinforcement-learning

Repo for the Deep Reinforcement Learning Nanodegree program

Jupyter Notebook 5,048 2,373 Updated Nov 16, 2023

lyhue1991 / eat_pytorch_in_20_days

Pytorch🍊🍉 is delicious, just eat it! 😋😋

Jupyter Notebook 5,770 1,220 Updated Feb 25, 2025

udlbook / udlbook

Understanding Deep Learning - Simon J.D. Prince

Jupyter Notebook 7,480 1,622 Updated May 22, 2025

jvpoulos / causal-ml

Must-read papers and resources related to causal inference and machine (deep) learning

714 130 Updated Nov 23, 2022

kochbj / Deep-Learning-for-Causal-Inference

Extensive tutorials for learning how to build deep learning models for causal inference (HTE) using selection on observables in Tensorflow 2 and Pytorch.

326 72 Updated Oct 17, 2024

rguo12 / awesome-causality-algorithms

An index of algorithms for learning causality with data

3,138 468 Updated Jan 22, 2025

dair-ai / ML-Papers-of-the-Week

🔥Highlighting the top ML papers every week.

11,271 686 Updated Apr 11, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

yct yongchengtao

Block or report yongchengtao

Starred repositories

Applied-Machine-Learning-Lab / GAVE

Cardinal-Operations / ORLM

nikhilbarhate99 / PPO-PyTorch

AI4Finance-Foundation / ElegantRL

xlite-dev / LeetCUDA

magicwt / ad-papers

XinJingHao / DRL-Pytorch