A curated list of safety-related papers, articles, and resources focused on Large Language Models (LLMs). This repository aims to provide researchers, practitioners, and enthusiasts with insights i…

1,125 57 Updated Jan 19, 2025

zjunlp / WorfBench

Benchmarking Agentic Workflow Generation

Python 36 2 Updated Nov 26, 2024

ThuCCSLab / Awesome-LM-SSP

A reading list for large models safety, security, and privacy (including Awesome LLM Security, Safety, etc.).

1,100 72 Updated Jan 17, 2025

DAMO-NLP-SG / Auto-Arena-LLMs

Jupyter Notebook 36 1 Updated Oct 7, 2024

YangLing0818 / SuperCorrect-llm

SuperCorrect: Supervising and Correcting Language Models with Error-Driven Insights

Python 45 1 Updated Oct 14, 2024

Blue-Raincoat / SelectIT

Python 16 2 Updated Oct 14, 2024

2003pro / TAGCOS

This is the official implementation of TAGCOS: Task-agnostic Gradient Clustered Coreset Selection for Instruction Tuning Data

Python 10 Updated Jul 21, 2024

microsoft / rho

Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.

383 11 Updated Apr 18, 2024

OFA-Sys / InsTag

InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuning

236 7 Updated Aug 20, 2023

tatsu-lab / alpaca_eval

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

Jupyter Notebook 1,610 250 Updated Dec 27, 2024

Instruction-Tuning-with-GPT-4 / GPT-4-LLM

Instruction Tuning with GPT-4

HTML 4,254 301 Updated Jun 11, 2023

nlpxucan / WizardLM

LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath

Python 9,311 725 Updated Aug 5, 2024

jianzhnie / awesome-instruction-datasets

A collection of awesome-prompt-datasets, awesome-instruction-dataset, to train ChatLLM such as chatgpt 收录各种各样的指令数据集, 用于训练 ChatLLM 模型。

582 30 Updated Apr 7, 2024

xypan0 / G-DIG

Python 10 Updated Jun 30, 2024

alon-albalak / data-selection-survey

A Survey on Data Selection for Language Models

202 10 Updated Oct 13, 2024

sail-sg / regmix

🧬 RegMix: Data Mixture as Regression for Language Model Pre-training

Jupyter Notebook 97 5 Updated Oct 3, 2024

hkust-nlp / deita

Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]

Python 526 29 Updated Dec 9, 2024

WeOpenML / PandaLM

Python 894 67 Updated May 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

hunxuewangzi

Block or report hunxuewangzi

Stars

zjunlp / OmniThink

Alibaba-NLP / WebWalker

git-disl / Vaccine

lyt719 / LLM-evaluation-datasets

git-disl / awesome_LLM-harmful-fine-tuning-papers

Unispac / shallow-vs-deep-alignment

wdndev / llm_interview_note

hellotransformers / Natural_Language_Processing_with_Transformers

chunhuizhang / personal_chatgpt

naklecha / llama3-from-scratch

princeton-nlp / benign-data-breaks-safety

llm-attacks / llm-attacks

ydyjya / Awesome-LLM-Safety