Stars
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
A high-throughput and memory-efficient inference and serving engine for LLMs
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型
LlamaIndex is the leading framework for building LLM-powered agents over your data.
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, and other large language models.
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
Stable Diffusion web UI
Code for the paper "Evaluating Large Language Models Trained on Code"
Must-read Papers on Large Language Model (LLM) Planning.
This repo includes ChatGPT prompt curation to use ChatGPT and other LLM tools better.
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
Plug in and Play Implementation of Tree of Thoughts: Deliberate Problem Solving with Large Language Models that Elevates Model Reasoning by atleast 70%
State-of-the-Art Text Embeddings
Code and documentation to train Stanford's Alpaca models, and generate the data.