sijiaxu

Follow

sijia xu sijiaxu

Follow

10 followers · 4 following

Shanghai, China

Achievements

Achievements

Stars

25 stars written in Python

All-Hands-AI / OpenHands

🙌 OpenHands: Code Less, Make More

Python 49,021 5,382 Updated Mar 7, 2025

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 43,245 5,294 Updated Mar 6, 2025

tatsu-lab / stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 29,872 4,059 Updated Jul 17, 2024

yoheinakajima / babyagi

Python 21,124 2,775 Updated Nov 6, 2024

ymcui / Chinese-LLaMA-Alpaca

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 18,732 1,892 Updated Apr 30, 2024

openai / baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Python 16,082 4,890 Updated Aug 1, 2024

horovod / horovod

Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.

Python 14,407 2,254 Updated Feb 1, 2025

openai / spinningup

An educational resource to help anyone learn deep reinforcement learning.

Python 10,615 2,288 Updated Aug 5, 2024

google-deepmind / pysc2

StarCraft II Learning Environment

Python 8,085 1,158 Updated Jul 23, 2024

ymcui / Chinese-LLaMA-Alpaca-2

中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)

Python 7,156 577 Updated Sep 23, 2024

yangjianxin1 / Firefly

Firefly: 大模型训练工具，支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Python 6,228 557 Updated Oct 24, 2024

dbiir / UER-py

Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo

Python 3,039 525 Updated May 9, 2024

agentica-project / deepscaler

Democratizing Reinforcement Learning for LLMs

Python 1,913 168 Updated Feb 16, 2025

showlab / computer_use_ootb

Out-of-the-box (OOTB) GUI Agent for Windows and macOS

Python 1,349 129 Updated Feb 26, 2025

chris-chris / pysc2-examples

StarCraft II - pysc2 Deep Reinforcement Learning Examples

Python 758 354 Updated Mar 3, 2021

yanpanlau / DDPG-Keras-Torcs

Using Keras and Deep Deterministic Policy Gradient to play TORCS

Python 721 266 Updated Dec 4, 2017

TorchCraft / StarData

Starcraft AI Research Dataset

Python 574 75 Updated Aug 30, 2021

bartdag / pymining

A few data mining algorithms in pure python

Python 467 109 Updated Oct 29, 2015

bilibili / LastOrder-Dota2

Dota2 AI bot

Python 405 48 Updated Aug 11, 2021

jaromiru / AI-blog

Accompanying repository for Let's make a DQN / A3C series.

Python 394 172 Updated Sep 4, 2018

pydota2 / pydota2_archive

Dota 2 Python AI

Python 99 9 Updated Dec 4, 2018

WeiTang114 / FMQ

A fast single-direction queue for multiprocessing.

Python 35 10 Updated Aug 12, 2019

TimZaman / dotaclient

distributed RL spaghetti al arabiata

Python 28 7 Updated Mar 29, 2019

pydota2 / pydota2

PyDota2 Framework Integrated with DotaService

Python 25 5 Updated Jan 17, 2019

zejunwang1 / gpt2classifier

基于中文 GPT2 预训练模型的文本分类微调

Python 21 Updated Mar 29, 2023