Skip to content
View sijiaxu's full-sized avatar

Block or report sijiaxu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Democratizing Reinforcement Learning for LLMs

Python 1,912 167 Updated Feb 16, 2025

🙌 OpenHands: Code Less, Make More

Python 49,004 5,378 Updated Mar 6, 2025

Out-of-the-box (OOTB) GUI Agent for Windows and macOS

Python 1,349 129 Updated Feb 26, 2025

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 29,871 4,060 Updated Jul 17, 2024

中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)

Python 7,156 577 Updated Sep 23, 2024

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 18,732 1,892 Updated Apr 30, 2024

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 43,233 5,292 Updated Mar 6, 2025

Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Python 6,228 557 Updated Oct 24, 2024

Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo

Python 3,039 525 Updated May 9, 2024

基于中文 GPT2 预训练模型的文本分类微调

Python 21 Updated Mar 29, 2023

Redis Cluster Daily Maintenance Tool/Redis集群日常运维工具

Shell 351 42 Updated Nov 18, 2022

Dota2 AI bot

Python 405 48 Updated Aug 11, 2021

A fast single-direction queue for multiprocessing.

Python 35 10 Updated Aug 12, 2019

ELF: a platform for game research with AlphaGoZero/AlphaZero reimplementation

C++ 3,388 567 Updated Jun 21, 2019

Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.

Python 14,408 2,254 Updated Feb 1, 2025

An educational resource to help anyone learn deep reinforcement learning.

Python 10,615 2,289 Updated Aug 5, 2024

Dota 2 Addon for Creep Block Episodic Reinforcement Learning

Lua 36 6 Updated Aug 24, 2017

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Python 16,082 4,891 Updated Aug 1, 2024

Work in progress for a full-overwrite Dota 2 bot framework

Lua 98 39 Updated Jun 12, 2017

distributed RL spaghetti al arabiata

Python 28 7 Updated Mar 29, 2019

Dota 2 API for machine learning

C++ 22 Updated Dec 13, 2018

Dota 2 Python AI

Python 99 9 Updated Dec 4, 2018

DotaService is a service to play Dota 2 through gRPC

Lua 119 19 Updated Feb 18, 2024

PyDota2 Framework Integrated with DotaService

Python 25 5 Updated Jan 17, 2019

StarCraft AI bot

C++ 63 21 Updated Dec 4, 2018

Just another Hearthstone Simulator in C# .Net Core, with some A.I. approaches!

C# 256 100 Updated Dec 8, 2022

Accompanying repository for Let's make a DQN / A3C series.

Python 394 172 Updated Sep 4, 2018

Connecting Torch to StarCraft

C++ 1,388 210 Updated Aug 30, 2021

Starcraft AI Research Dataset

Python 574 75 Updated Aug 30, 2021
Next