sijiaxu

Follow

sijia xu sijiaxu

Follow

10 followers · 4 following

Shanghai, China

Achievements

Achievements

Stars

agentica-project / deepscaler

Democratizing Reinforcement Learning for LLMs

Python 1,912 167 Updated Feb 16, 2025

All-Hands-AI / OpenHands

🙌 OpenHands: Code Less, Make More

Python 49,004 5,378 Updated Mar 6, 2025

showlab / computer_use_ootb

Out-of-the-box (OOTB) GUI Agent for Windows and macOS

Python 1,349 129 Updated Feb 26, 2025

tatsu-lab / stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 29,871 4,060 Updated Jul 17, 2024

ymcui / Chinese-LLaMA-Alpaca-2

中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)

Python 7,156 577 Updated Sep 23, 2024

ymcui / Chinese-LLaMA-Alpaca

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 18,732 1,892 Updated Apr 30, 2024

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 43,233 5,292 Updated Mar 6, 2025

yangjianxin1 / Firefly

Firefly: 大模型训练工具，支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Python 6,228 557 Updated Oct 24, 2024

dbiir / UER-py

Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo

Python 3,039 525 Updated May 9, 2024

zejunwang1 / gpt2classifier

基于中文 GPT2 预训练模型的文本分类微调

Python 21 Updated Mar 29, 2023

yoheinakajima / babyagi

Python 21,123 2,775 Updated Nov 6, 2024

iwhalecloud-platform / redis-tool

Redis Cluster Daily Maintenance Tool/Redis集群日常运维工具

Shell 351 42 Updated Nov 18, 2022

bilibili / LastOrder-Dota2

Dota2 AI bot

Python 405 48 Updated Aug 11, 2021

WeiTang114 / FMQ

A fast single-direction queue for multiprocessing.

Python 35 10 Updated Aug 12, 2019

pytorch / ELF

ELF: a platform for game research with AlphaGoZero/AlphaZero reimplementation

C++ 3,388 567 Updated Jun 21, 2019

horovod / horovod

Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.

Python 14,408 2,254 Updated Feb 1, 2025

openai / spinningup

An educational resource to help anyone learn deep reinforcement learning.

Python 10,615 2,289 Updated Aug 5, 2024

BeyondGodlikeBot / CreepBlockAI

Dota 2 Addon for Creep Block Episodic Reinforcement Learning

Lua 36 6 Updated Aug 24, 2017

openai / baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Python 16,082 4,891 Updated Aug 1, 2024

Nostrademous / Dota2-FullOverwrite

Work in progress for a full-overwrite Dota 2 bot framework

Lua 98 39 Updated Jun 12, 2017

TimZaman / dotaclient

distributed RL spaghetti al arabiata

Python 28 7 Updated Mar 29, 2019

2aius / d2ai

Dota 2 API for machine learning

C++ 22 Updated Dec 13, 2018

pydota2 / pydota2_archive

Dota 2 Python AI

Python 99 9 Updated Dec 4, 2018

TimZaman / dotaservice

DotaService is a service to play Dota 2 through gRPC

Lua 119 19 Updated Feb 18, 2024

pydota2 / pydota2

PyDota2 Framework Integrated with DotaService

Python 25 5 Updated Jan 17, 2019

bilibili / LastOrder

StarCraft AI bot

C++ 63 21 Updated Dec 4, 2018

HearthSim / SabberStone

Just another Hearthstone Simulator in C# .Net Core, with some A.I. approaches!

C# 256 100 Updated Dec 8, 2022

jaromiru / AI-blog

Accompanying repository for Let's make a DQN / A3C series.

Python 394 172 Updated Sep 4, 2018

TorchCraft / TorchCraft

Connecting Torch to StarCraft

C++ 1,388 210 Updated Aug 30, 2021

TorchCraft / StarData

Starcraft AI Research Dataset

Python 574 75 Updated Aug 30, 2021