Skip to content
View sijiaxu's full-sized avatar

Block or report sijiaxu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
25 stars written in Python
Clear filter

🙌 OpenHands: Code Less, Make More

Python 49,021 5,382 Updated Mar 7, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 43,245 5,294 Updated Mar 6, 2025

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 29,872 4,059 Updated Jul 17, 2024

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 18,732 1,892 Updated Apr 30, 2024

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Python 16,082 4,890 Updated Aug 1, 2024

Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.

Python 14,407 2,254 Updated Feb 1, 2025

An educational resource to help anyone learn deep reinforcement learning.

Python 10,615 2,288 Updated Aug 5, 2024

StarCraft II Learning Environment

Python 8,085 1,158 Updated Jul 23, 2024

中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)

Python 7,156 577 Updated Sep 23, 2024

Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Python 6,228 557 Updated Oct 24, 2024

Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo

Python 3,039 525 Updated May 9, 2024

Democratizing Reinforcement Learning for LLMs

Python 1,913 168 Updated Feb 16, 2025

Out-of-the-box (OOTB) GUI Agent for Windows and macOS

Python 1,349 129 Updated Feb 26, 2025

StarCraft II - pysc2 Deep Reinforcement Learning Examples

Python 758 354 Updated Mar 3, 2021

Using Keras and Deep Deterministic Policy Gradient to play TORCS

Python 721 266 Updated Dec 4, 2017

Starcraft AI Research Dataset

Python 574 75 Updated Aug 30, 2021

A few data mining algorithms in pure python

Python 467 109 Updated Oct 29, 2015

Dota2 AI bot

Python 405 48 Updated Aug 11, 2021

Accompanying repository for Let's make a DQN / A3C series.

Python 394 172 Updated Sep 4, 2018

Dota 2 Python AI

Python 99 9 Updated Dec 4, 2018

A fast single-direction queue for multiprocessing.

Python 35 10 Updated Aug 12, 2019

distributed RL spaghetti al arabiata

Python 28 7 Updated Mar 29, 2019

PyDota2 Framework Integrated with DotaService

Python 25 5 Updated Jan 17, 2019

基于中文 GPT2 预训练模型的文本分类微调

Python 21 Updated Mar 29, 2023