Stars
Democratizing Reinforcement Learning for LLMs
Out-of-the-box (OOTB) GUI Agent for Windows and macOS
Code and documentation to train Stanford's Alpaca models, and generate the data.
中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型
Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo
Redis Cluster Daily Maintenance Tool/Redis集群日常运维工具
A fast single-direction queue for multiprocessing.
ELF: a platform for game research with AlphaGoZero/AlphaZero reimplementation
Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
An educational resource to help anyone learn deep reinforcement learning.
Dota 2 Addon for Creep Block Episodic Reinforcement Learning
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
Work in progress for a full-overwrite Dota 2 bot framework
DotaService is a service to play Dota 2 through gRPC
Just another Hearthstone Simulator in C# .Net Core, with some A.I. approaches!
Accompanying repository for Let's make a DQN / A3C series.