Skip to content
View yongchengtao's full-sized avatar

Block or report yongchengtao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results
Python 8 Updated Jan 24, 2025

ORLM: Training Large Language Models for Optimization Modeling

Python 152 21 Updated Apr 3, 2025

Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch

Python 2,037 378 Updated Jul 9, 2024

Massively Parallel Deep Reinforcement Learning. 🔥

Python 4,035 909 Updated May 8, 2025

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA/Tensor Cores Kernels, HGEMM, FA-2 MMA etc.🔥

Cuda 4,428 467 Updated May 17, 2025
13 2 Updated Nov 11, 2024

Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)

Python 2,521 319 Updated May 10, 2025

Train your grpo with zero dataset and low resources, 8bit/4bit/lora/qlora supported, multi-gpu supported ...

Python 71 7 Updated Apr 30, 2025

A collection of research and survey papers of real-time bidding (RTB) based display advertising techniques.

3,636 936 Updated Dec 20, 2024

计算广告机制策略相关材料整理(A collection of research and application papers about Strategy in Internet advertising.)

161 22 Updated Feb 18, 2024

Companion webpage to the book "Mathematics For Machine Learning"

Jupyter Notebook 14,121 2,563 Updated Mar 13, 2025

PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

Python 4,306 876 Updated Mar 24, 2023

codes for SORL framework for auto-bidding

Python 40 11 Updated Oct 13, 2022

AuctionNet: A Novel Benchmark for Decision-Making in Large-Scale Games

Python 158 12 Updated Apr 25, 2025

强化学习中文教程(蘑菇书🍄),在线阅读地址:https://datawhalechina.github.io/easy-rl/

Jupyter Notebook 11,350 2,029 Updated May 13, 2025

GitHub's code repository is all you need

349 42 Updated Mar 21, 2023

搜索、推荐、广告、用增等工业界实践文章收集(来源:知乎、Datafuntalk、技术公众号)

Python 3,387 391 Updated May 23, 2025

Budget Constrained Bidding for Display Advertising using Model-free Reinforcement Learning

Python 42 20 Updated Dec 13, 2019

本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)

HTML 17,801 2,097 Updated May 1, 2025

精选机器学习,NLP,图像识别, 深度学习等人工智能领域学习资料,搜索,推荐,广告系统架构及算法技术资料整理。算法大牛笔记汇总

3,434 503 Updated Apr 15, 2024

《李宏毅深度学习教程》(李宏毅老师推荐👍,苹果书🍎),PDF下载地址:https://github.com/datawhalechina/leedl-tutorial/releases

Jupyter Notebook 15,142 3,025 Updated May 13, 2025

📁Clash客户端备份-Clash client backup

403 199 Updated Nov 28, 2024

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Jupyter Notebook 52,217 5,581 Updated May 12, 2025

Repo for the Deep Reinforcement Learning Nanodegree program

Jupyter Notebook 5,048 2,373 Updated Nov 16, 2023

Pytorch🍊🍉 is delicious, just eat it! 😋😋

Jupyter Notebook 5,770 1,220 Updated Feb 25, 2025

Understanding Deep Learning - Simon J.D. Prince

Jupyter Notebook 7,480 1,622 Updated May 22, 2025

Must-read papers and resources related to causal inference and machine (deep) learning

714 130 Updated Nov 23, 2022

Extensive tutorials for learning how to build deep learning models for causal inference (HTE) using selection on observables in Tensorflow 2 and Pytorch.

326 72 Updated Oct 17, 2024

An index of algorithms for learning causality with data

3,138 468 Updated Jan 22, 2025

🔥Highlighting the top ML papers every week.

11,271 686 Updated Apr 11, 2025
Next