guanlongtianzi

guanlongtianzi

3 followers · 7 following

Stars

unslothai / unsloth

Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! 🦥

Python 32,811 2,186 Updated Feb 28, 2025

lsdefine / simple_GRPO

A very simple GRPO implement for reproducing r1-like LLM thinking.

Python 605 42 Updated Feb 28, 2025

brendanhogan / DeepSeekRL-Extended

Exploring Applications of GRPO

Python 102 9 Updated Feb 16, 2025

PKU-Alignment / align-anything

Align Anything: Training All-modality Model with Feedback

Python 2,443 337 Updated Feb 28, 2025

zhuang-li / FactualSceneGraph

FACTUAL benchmark dataset, the pre-trained textual scene graph parser trained on FACTUAL.

Python 102 12 Updated Feb 9, 2025

EvolvingLMMs-Lab / lmms-eval

Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.

Python 2,145 213 Updated Feb 28, 2025

itcharge / LeetCode-Py

⛽️「算法通关手册」：超详细的「算法与数据结构」基础讲解教程，从零基础开始学习算法知识，850+ 道「LeetCode 题目」详细解析，200 道「大厂面试热门题目」。

Python 6,494 1,168 Updated Jan 16, 2025

trotsky1997 / MathBlackBox

Python 1,006 102 Updated Dec 17, 2024

HandsOnLLM / Hands-On-Large-Language-Models

Official code repo for the O'Reilly Book - "Hands-On Large Language Models"

Jupyter Notebook 5,340 1,181 Updated Feb 15, 2025

pavanjava / bootstrap-rag

this project will bootstrap and scaffold the projects for specific semantic search and RAG applications along with regular boiler plate code.

Python 88 12 Updated Dec 16, 2024

tencent-ailab / persona-hub

Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"

Python 1,037 72 Updated Feb 19, 2025

princeton-nlp / SimPO

[NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward

Python 833 57 Updated Feb 16, 2025

eric-mitchell / direct-preference-optimization

Reference implementation for DPO (Direct Preference Optimization)

Python 2,405 201 Updated Aug 11, 2024

ContextualAI / HALOs

A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).

Python 809 48 Updated Feb 11, 2025

chenzomi12 / AISystem

AISystem 主要是指AI系统，包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Jupyter Notebook 12,630 1,818 Updated Jan 2, 2025

waylandzhang / Transformer-from-scratch

Jupyter Notebook 338 95 Updated Apr 29, 2024

arcee-ai / mergekit

Tools for merging pretrained large language models.

Python 5,323 502 Updated Feb 28, 2025

isaacus-dev / semchunk

A fast, lightweight and easy-to-use Python library for splitting text into semantically meaningful chunks.

Python 248 15 Updated Feb 18, 2025

deeplearning-wisc / picle

Official code for ICML 2024 paper on Persona In-Context Learning (PICLe)

Python 23 1 Updated Jun 27, 2024

TutteInstitute / evoc

Embedding Vector Oriented Clustering

Python 132 6 Updated Feb 28, 2025

Ki-Seki / chat_prompt_templates

Collection of Basic Prompt Templates for Various Chat LLMs (Chat LLM 的基础提示模板集合)

37 7 Updated Oct 22, 2024

fzliu / radient

Radient turns many data types (not just text) into vectors for similarity search, RAG, regression analysis, and more.

Python 273 11 Updated Dec 31, 2024

agno-agi / agno

Agno is a lightweight library for building multi-modal Agents

Python 19,586 2,631 Updated Feb 28, 2025

deepseek-ai / DeepSeek-V2

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

4,810 499 Updated Sep 25, 2024

Lordog / dive-into-llms

《动手学大模型Dive into LLMs》系列编程实践教程

4,460 401 Updated Sep 20, 2024

mapull / chinese-dictionary

中文汉语拼音辞典，汉字拼音字典，词典，成语词典，常用字、多音字字典数据库

546 128 Updated Feb 4, 2025

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 42,496 5,190 Updated Feb 28, 2025

UnicomAI / Unichat-llama3-Chinese

Python 351 38 Updated Jul 27, 2024

mistralai / mistral-inference

Official inference library for Mistral models

Jupyter Notebook 10,024 896 Updated Nov 12, 2024

yuanchenyang / smalldiffusion

Simple and readable code for training and sampling from diffusion models

Python 248 20 Updated Jan 9, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly