Skip to content
View MrReochen's full-sized avatar

Block or report MrReochen

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Inference code for Llama models

Python 57,520 9,693 Updated Jan 26, 2025

程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).

Dockerfile 68,900 8,814 Updated Feb 3, 2025

A list of Free Software network services and web applications which can be hosted on your own servers

215,449 10,195 Updated Feb 5, 2025

This is the official implementation of Multi-Agent PPO (MAPPO).

Python 1,414 314 Updated Jul 18, 2024

Codes accompanying the paper "RODE: Learning Roles to Decompose Multi-Agent Tasks (ICLR 2021, https://arxiv.org/abs/2010.01523). RODE is a scalable role-based multi-agent learning method which effe…

Python 71 20 Updated Dec 17, 2024

An elegant \LaTeX\ résumé template. 大陆镜像 https://gods.coding.net/p/resume/git

TeX 9,478 2,635 Updated Mar 15, 2024

PyTorch implementations of Generative Adversarial Networks.

Python 16,719 4,098 Updated Jun 18, 2024
C 2 7 Updated Dec 21, 2020

A platform for Reasoning systems (Reinforcement Learning, Contextual Bandits, etc.)

Python 3,585 523 Updated Jan 6, 2025

Simple implementation of Conditional Random Fields (CRF) in Python. A faster, more powerful, Cython implementation is available in the vocrf project https://github.com/timvieira/vocrf

Python 342 116 Updated Sep 7, 2021

Codes accompanying the paper "ROMA: Multi-Agent Reinforcement Learning with Emergent Roles" (ICML 2020 https://arxiv.org/abs/2003.08039)

Python 154 34 Updated Dec 8, 2022

Multi Agent Reinforcement Learning using MalmÖ

Python 246 46 Updated Apr 14, 2020

A fast reverse proxy to help you expose a local server behind a NAT or firewall to the internet.

Go 90,187 13,684 Updated Feb 7, 2025

VS Code in the browser

TypeScript 69,681 5,754 Updated Feb 1, 2025

An elegant PyTorch deep reinforcement learning library.

Python 8,162 1,125 Updated Jan 26, 2025

A docker image which provides openconnect with proxy

Dockerfile 1 Updated Mar 12, 2020

PyTorch implementations of deep reinforcement learning algorithms and environments

Python 5,703 1,202 Updated Jul 25, 2024

StarCraft II Client - protocol definitions used to communicate with StarCraft II.

Python 3,821 434 Updated Dec 4, 2024

Implementations of IQL, QMIX, VDN, COMA, QTRAN, MAVEN, CommNet, DyMA-CL, and G2ANet on SMAC, the decentralised micromanagement scenario of StarCraft II

Python 1,506 286 Updated Sep 8, 2022

FEN Code

Python 37 8 Updated Nov 4, 2019

Hello, I pushed some python environments for Multi Agent Reinforcement Learning.

Python 688 127 Updated May 23, 2022

An interface with micropolis for city-building agents, packaged as an OpenAI gym environment

C 144 18 Updated Oct 12, 2021

Latex code for making neural networks diagrams

TeX 22,669 2,910 Updated Aug 21, 2023

Reinforcement Learning in PyTorch

Python 2,239 327 Updated Jan 4, 2021

Environment generation code for the paper "Emergent Tool Use From Multi-Agent Autocurricula"

Python 1,682 311 Updated Jul 30, 2024

The basic distribution probability Tutorial for Deep Learning Researchers

Python 1,628 387 Updated Oct 1, 2020

此项目是机器学习(Machine Learning)、深度学习(Deep Learning)、NLP面试中常考到的知识点和代码实现,也是作为一个算法工程师必会的理论基础知识。

Jupyter Notebook 16,306 4,581 Updated Jun 21, 2022

Code for the paper "Neural MMO: A Massively Multiagent Game Environment for Training and Evaluating Intelligent Agents"

Python 1,599 265 Updated Jul 21, 2023

Reproduce some methods in semi-supervised papers.

Python 37 2 Updated Jul 9, 2019
Next