xiexinran

Follow

xiexinran

Follow

2 followers · 1 following

Stars

datawhalechina / self-llm

《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调（全参数/Lora）、部署国内外开源大模型（LLM）/多模态大模型（MLLM）教程

Jupyter Notebook 14,040 1,609 Updated Mar 22, 2025

RuilinXu / GovDoc-CN

A Multi-Modal Dataset of Chinese Governmental Docunments

31 5 Updated Dec 8, 2020

ZhiningLiu1998 / SelfElicit

SelfElicit: Your Language Model Secretly Knows Where is the Relevant Evidence!

Jupyter Notebook 8 Updated Feb 17, 2025

bbuing9 / ICLR24_SuRe

Official Code for the paper "SuRe: Summarizing Retrievals using Answer Candidates for Open-domain QA of LLMs" (ICLR 2024)

Python 23 Updated May 7, 2024

ZJU-LLMs / Foundations-of-LLMs

9,222 789 Updated Jan 14, 2025

Mooler0410 / LLMsPracticalGuide

A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)

9,784 762 Updated May 31, 2024

ielab / llm-rankers

Document Ranking with Large Language Models.

Python 126 15 Updated Mar 20, 2025

gabriben / awesome-generative-information-retrieval

664 49 Updated Oct 15, 2024

texttron / hyde

HyDE: Precise Zero-Shot Dense Retrieval without Relevance Labels

Jupyter Notebook 520 38 Updated Dec 6, 2024

RUC-NLPIR / LLM4IR-Survey

This is the repo for the survey of LLM4IR.

472 37 Updated Sep 5, 2024

irlab-sdu / fuzi.mingcha

夫子•明察司法大模型是由山东大学、浪潮云、中国政法大学联合研发，以 ChatGLM 为大模型底座，基于海量中文无监督司法语料与有监督司法微调数据训练的中文司法大模型。该模型支持法条检索、案例分析、三段论推理判决以及司法对话等功能，旨在为用户提供全方位、高精准的法律咨询与解答服务。

Python 316 24 Updated Oct 25, 2024

jeinlee1991 / chinese-llm-benchmark

目前已囊括203个大模型，覆盖chatgpt、gpt-4o、o3-mini、谷歌gemini、Claude3.5、智谱GLM-Zero、文心一言、qwen-max、百川、讯飞星火、商汤senseChat、minimax等商用模型，以及DeepSeek-R1、qwq-32b、deepseek-v3、qwen2.5、llama3.3、phi-4、glm4、gemma3、mistral、书生in…

3,832 166 Updated Mar 24, 2025

baichuan-inc / Baichuan2

A series of large language models developed by Baichuan Intelligent Technology

Python 4,130 299 Updated Nov 8, 2024

shibing624 / MedicalGPT

MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型，实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO、GRPO。

Python 3,742 548 Updated Mar 20, 2025

tloen / alpaca-lora

Instruct-tune LLaMA on consumer hardware

Jupyter Notebook 18,859 2,227 Updated Jul 29, 2024

liguodongiot / llm-action

本项目旨在分享大模型相关技术原理以及实战经验（大模型工程化、大模型应用落地）

HTML 15,638 1,820 Updated Mar 2, 2025

FlagOpen / FlagInstruct

172 3 Updated Apr 20, 2023

chatchat-space / Langchain-Chatchat

Langchain-Chatchat（原Langchain-ChatGLM）基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and…

TypeScript 34,335 5,816 Updated Nov 29, 2024

21335732529sky / negative_supervision

The implementation of Text Classification with Negative Supervision (ACL, 2020)

Python 9 4 Updated Oct 8, 2020

CodeAsPoetry / PublicOpinion

與情分析系统，包括爬虫、数据清洗、文本摘要、主题分类、情感倾向性识别以及分析结果数据可视化

Python 383 65 Updated Jul 16, 2022

iamshuaidi / CS-Book

计算机类常用电子书整理，并且附带下载链接，包括Java，Python，Linux，Go，C，C++，数据结构与算法，人工智能，计算机基础，面试，设计模式，数据库，前端等书籍

11,386 2,706 Updated Jul 28, 2023

kangjianwei / Data-Structure

《数据结构》-严蔚敏.吴伟民-教材源码与习题解析

C 3,669 988 Updated Jul 20, 2022

thunlp / CLAIM

79 49 Updated Jun 29, 2020

myx666 / LeCaRD

A Chinese legal case retrieval dataset.

Python 135 19 Updated Jan 2, 2024

beir-cellar / beir

A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.

Python 1,747 201 Updated Feb 25, 2025

stanford-futuredata / ColBERT

ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)

Python 3,283 412 Updated Nov 18, 2024

lekhang4497 / qasper-retriever-reader

Python 1 Updated Apr 28, 2022

Yunfan-Li / Contrastive-Clustering

Code for the paper "Contrastive Clustering" (AAAI 2021)

Python 310 94 Updated Jul 11, 2022

princeton-nlp / LM-BFF

[ACL 2021] LM-BFF: Better Few-shot Fine-tuning of Language Models https://arxiv.org/abs/2012.15723

Python 727 132 Updated Aug 29, 2022

YujiaBao / Distributional-Signatures

"Few-shot Text Classification with Distributional Signatures" ICLR 2020

Python 257 54 Updated Dec 17, 2020