Skip to content
View KeFttan's full-sized avatar
🍀
🍀

Block or report KeFttan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The implement of all kinds of dqn reinforcement learning with Pytorch

Python 94 22 Updated Mar 25, 2021

Pytorch Implementation of DQN / DDQN / Prioritized replay/ noisy networks/ distributional values/ Rainbow/ hierarchical RL

Jupyter Notebook 3,079 592 Updated Nov 4, 2021

Implementation of some of the Deep Distributional Reinforcement Learning Algorithms.

Python 11 3 Updated Mar 13, 2022

PFRL: a PyTorch-based deep reinforcement learning library

Python 1,223 160 Updated Aug 4, 2024

DI-engine docs (Chinese and English)

Python 296 63 Updated Mar 10, 2025

XuanCe: A Comprehensive and Unified Deep Reinforcement Learning Library

Python 782 120 Updated Mar 14, 2025

PyTorch implementation of QR-DQN: Distributional Reinforcement Learning with Quantile Regression

Jupyter Notebook 26 10 Updated Aug 16, 2020

PyTorch implementation of FQF, IQN and QR-DQN.

Python 171 27 Updated Jul 25, 2024

This is the official code release of the following paper: Hao Dong et al., Temporal Inductive Path Neural Network for Temporal Knowledge Graph Reasoning.

Python 26 1 Updated Feb 15, 2024

Awesome papers about machine learning (deep learning) on dynamic (temporal) graphs (networks / knowledge graphs).

Shell 630 83 Updated Dec 20, 2024

Implementation of Truncated Quantile Critics method for continuous reinforcement learning. https://bayesgroup.github.io/tqc/

Python 93 16 Updated Feb 16, 2021

Influence maximization in unknown social networks: Learning Policies for Effective Graph Sampling (official code repository)

Python 28 10 Updated Jul 18, 2022

Translate PDF, EPub, webpage, metadata, annotations, notes to the target language. Support 20+ translate services.

TypeScript 8,428 391 Updated Mar 17, 2025

Code for the paper Fine-Tuning Language Models from Human Preferences

Python 1,300 168 Updated Jul 25, 2023

Tuning LLMs with no tears💦; Sample Design Engineering (SDE) for more efficient downstream-tuning.

HTML 989 99 Updated Apr 27, 2024

本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)

HTML 15,494 1,790 Updated Mar 2, 2025

Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复

Python 18,802 1,950 Updated Apr 4, 2024

A list of papers regarding generalization in (deep) reinforcement learning

11 1 Updated Aug 13, 2023

Inference code for LLaMA models

Python 118 27 Updated Aug 13, 2023

Long-Term Evolution Project of Reinforcement Learning

Python 470 86 Updated Jan 4, 2025

ChatGPT资料汇总学习,持续更新......

4,134 385 Updated Nov 30, 2024

A Gradio web UI for Large Language Models with support for multiple inference backends.

Python 42,937 5,534 Updated Mar 17, 2025

A pytorch adversarial library for attack and defense methods on images and graphs

Python 1,024 193 Updated Jul 23, 2024

A series of large language models developed by Baichuan Intelligent Technology

Python 4,127 299 Updated Nov 8, 2024

Deep Reinforcement Learning with pytorch & visdom

Python 799 143 Updated Jul 16, 2020

Pytorch🍊🍉 is delicious, just eat it! 😋😋

Jupyter Notebook 5,605 1,199 Updated Feb 25, 2025

This collection of papers can be used to summarize research about graph reinforcement learning for the convenience of researchers.

174 22 Updated Nov 29, 2024

This is the official code release of the following paper: Hao Dong et al., Adaptive Path-Memory Network for Temporal Knowledge Graph Reasoning.

Python 18 Updated Jan 31, 2024

This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."

MATLAB 7,648 800 Updated Mar 20, 2025
Next