Skip to content
View jotline's full-sized avatar

Block or report jotline

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

这是一个从头训练大语言模型的项目,包括预训练、微调和直接偏好优化,模型拥有1B参数,支持中英文。

Python 396 53 Updated Feb 18, 2025

LLM finetuning for Sudoku solving

Python 4 Updated May 21, 2025

新词发现 基于词频、凝聚系数和左右邻接信息熵

Python 122 23 Updated Mar 14, 2020

📚A curated list of Awesome LLM/VLM Inference Papers with codes: WINT8/4, FlashAttention, PagedAttention, Parallelism, MLA, etc.

Python 4,027 277 Updated May 18, 2025

[NeurIPS'24 Spotlight, ICLR'25, ICML'25] To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces inference latency by up to 10x for pre-filli…

Python 1,031 53 Updated May 21, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 14,572 1,820 Updated May 23, 2025

[ICML'2024] Can AI Assistants Know What They Don't Know?

Python 81 9 Updated Feb 5, 2024

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

3,866 274 Updated Apr 13, 2025

The official GitHub page for the survey paper "A Survey of Large Language Models".

Python 11,506 888 Updated Mar 11, 2025

Curated tutorials and resources for Large Language Models, Text2SQL, Text2DSL、Text2API、Text2Vis and more.

2,757 196 Updated Mar 21, 2025

NL2SQL competition dataset

202 47 Updated Jul 19, 2023

Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.

Python 11,277 722 Updated May 19, 2025

Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search

Go 34,851 3,220 Updated May 23, 2025

General technology for enabling AI capabilities w/ LLMs and MLLMs

Python 3,998 309 Updated Apr 16, 2025

Code and models for ICML 2024 paper, NExT-GPT: Any-to-Any Multimodal Large Language Model

Python 3,499 352 Updated May 13, 2025

🦜🔗 Build context-aware reasoning applications

Jupyter Notebook 107,981 17,577 Updated May 22, 2025

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 6,391 548 Updated May 21, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 49,448 6,021 Updated May 21, 2025

A large-scale 7B pretraining language model developed by BaiChuan-Inc.

Python 5,688 504 Updated Jul 18, 2024

A (somewhat) minimal library for finetuning language models with PPO on human feedback.

Python 85 17 Updated Nov 23, 2022

Llama中文社区,实时汇总最新Llama学习资料,构建最好的中文Llama大模型开源生态,完全开源可商用

Python 14,593 1,304 Updated Apr 6, 2025

对ChatGLM直接使用RLHF提升或降低目标输出概率|Modify ChatGLM output with only RLHF

Python 193 27 Updated May 23, 2023

程序员延寿指南 | A programmer's guide to live longer

31,874 2,211 Updated May 19, 2025

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 18,491 1,873 Updated May 21, 2025

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Python 41,054 5,219 Updated Jun 27, 2024

BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)

HTML 8,157 767 Updated Oct 16, 2024
JavaScript 3,480 685 Updated May 22, 2025

Conversational Recommender System (CRS) paper list. 对话推荐系统论文列表

147 25 Updated Nov 24, 2022

CRSLab is an open-source toolkit for building Conversational Recommender System (CRS).

Python 531 115 Updated Apr 12, 2024

User-Centric Conversational Recommendation with Multi-Aspect User Modeling (UCCR)

Python 39 7 Updated Jul 7, 2022
Next