Stars
OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking
🌐 WebWaker: Benchmarking LLMs in Web Traversal
This is the official code for the paper "Vaccine: Perturbation-aware Alignment for Large Language Models" (NeurIPS2024)
A survey on harmful fine-tuning attack for large language model
Official Repository for The Paper: Safety Alignment Should Be Made More Than Just a Few Tokens Deep
Natural Language Processing with Transformers 中译本,最权威Transformers教程
llama3 implementation one matrix multiplication at a time
Universal and Transferable Attacks on Aligned Language Models
A curated list of safety-related papers, articles, and resources focused on Large Language Models (LLMs). This repository aims to provide researchers, practitioners, and enthusiasts with insights i…
A reading list for large models safety, security, and privacy (including Awesome LLM Security, Safety, etc.).
SuperCorrect: Supervising and Correcting Language Models with Error-Driven Insights
This is the official implementation of TAGCOS: Task-agnostic Gradient Clustered Coreset Selection for Instruction Tuning Data
Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.
InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuning
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
Instruction Tuning with GPT-4
LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath
A collection of awesome-prompt-datasets, awesome-instruction-dataset, to train ChatLLM such as chatgpt 收录各种各样的指令数据集, 用于训练 ChatLLM 模型。
A Survey on Data Selection for Language Models
🧬 RegMix: Data Mixture as Regression for Language Model Pre-training
Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]