Stars
Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! 🦥
A very simple GRPO implement for reproducing r1-like LLM thinking.
Align Anything: Training All-modality Model with Feedback
FACTUAL benchmark dataset, the pre-trained textual scene graph parser trained on FACTUAL.
Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.
⛽️「算法通关手册」:超详细的「算法与数据结构」基础讲解教程,从零基础开始学习算法知识,850+ 道「LeetCode 题目」详细解析,200 道「大厂面试热门题目」。
Official code repo for the O'Reilly Book - "Hands-On Large Language Models"
this project will bootstrap and scaffold the projects for specific semantic search and RAG applications along with regular boiler plate code.
Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"
[NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward
Reference implementation for DPO (Direct Preference Optimization)
A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).
AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术
Tools for merging pretrained large language models.
A fast, lightweight and easy-to-use Python library for splitting text into semantically meaningful chunks.
Official code for ICML 2024 paper on Persona In-Context Learning (PICLe)
Collection of Basic Prompt Templates for Various Chat LLMs (Chat LLM 的基础提示模板集合)
Radient turns many data types (not just text) into vectors for similarity search, RAG, regression analysis, and more.
Agno is a lightweight library for building multi-modal Agents
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Official inference library for Mistral models
Simple and readable code for training and sampling from diffusion models