Skip to content
View guanlongtianzi's full-sized avatar

Block or report guanlongtianzi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! 🦥

Python 32,811 2,186 Updated Feb 28, 2025

A very simple GRPO implement for reproducing r1-like LLM thinking.

Python 605 42 Updated Feb 28, 2025

Exploring Applications of GRPO

Python 102 9 Updated Feb 16, 2025

Align Anything: Training All-modality Model with Feedback

Python 2,443 337 Updated Feb 28, 2025

FACTUAL benchmark dataset, the pre-trained textual scene graph parser trained on FACTUAL.

Python 102 12 Updated Feb 9, 2025

Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.

Python 2,145 213 Updated Feb 28, 2025

⛽️「算法通关手册」:超详细的「算法与数据结构」基础讲解教程,从零基础开始学习算法知识,850+ 道「LeetCode 题目」详细解析,200 道「大厂面试热门题目」。

Python 6,494 1,168 Updated Jan 16, 2025

Official code repo for the O'Reilly Book - "Hands-On Large Language Models"

Jupyter Notebook 5,340 1,181 Updated Feb 15, 2025

this project will bootstrap and scaffold the projects for specific semantic search and RAG applications along with regular boiler plate code.

Python 88 12 Updated Dec 16, 2024

Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"

Python 1,037 72 Updated Feb 19, 2025

[NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward

Python 833 57 Updated Feb 16, 2025

Reference implementation for DPO (Direct Preference Optimization)

Python 2,405 201 Updated Aug 11, 2024

A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).

Python 809 48 Updated Feb 11, 2025

AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Jupyter Notebook 12,630 1,818 Updated Jan 2, 2025
Jupyter Notebook 338 95 Updated Apr 29, 2024

Tools for merging pretrained large language models.

Python 5,323 502 Updated Feb 28, 2025

A fast, lightweight and easy-to-use Python library for splitting text into semantically meaningful chunks.

Python 248 15 Updated Feb 18, 2025

Official code for ICML 2024 paper on Persona In-Context Learning (PICLe)

Python 23 1 Updated Jun 27, 2024

Embedding Vector Oriented Clustering

Python 132 6 Updated Feb 28, 2025

Collection of Basic Prompt Templates for Various Chat LLMs (Chat LLM 的基础提示模板集合)

37 7 Updated Oct 22, 2024

Radient turns many data types (not just text) into vectors for similarity search, RAG, regression analysis, and more.

Python 273 11 Updated Dec 31, 2024

Agno is a lightweight library for building multi-modal Agents

Python 19,586 2,631 Updated Feb 28, 2025

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

4,810 499 Updated Sep 25, 2024

《动手学大模型Dive into LLMs》系列编程实践教程

4,460 401 Updated Sep 20, 2024

中文汉语拼音辞典,汉字拼音字典,词典,成语词典,常用字、多音字字典数据库

546 128 Updated Feb 4, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 42,496 5,190 Updated Feb 28, 2025

Official inference library for Mistral models

Jupyter Notebook 10,024 896 Updated Nov 12, 2024

Simple and readable code for training and sampling from diffusion models

Python 248 20 Updated Jan 9, 2025
Next