

Starred repositories
MAC-SQL: A Multi-Agent Collaborative Framework for Text-to-SQL
Code and data for the paper "DBCᴏᴘɪʟᴏᴛ: Natural Language Querying over Massive Database via Schema Routing" (EDBT 2025)
A resource repository for representation engineering in large language models
A reading list for large models safety, security, and privacy (including Awesome LLM Security, Safety, etc.).
This project aims to consolidate and share high-quality resources and tools across the cybersecurity domain.
Official PyTorch implementation for "Large Language Diffusion Models"
Official Repository for ACL 2024 Paper SafeDecoding: Defending against Jailbreak Attacks via Safety-Aware Decoding
Collection of leaked system prompts
A course on aligning smol models.
A fast + lightweight implementation of the GCG algorithm in PyTorch
DeepSeek Coder: Let the Code Write Itself
No fortress, purely open ground. OpenManus is Coming.
An invisible desktop application to help you pass your technical interviews.
Improved techniques for optimization-based jailbreaking on large language models (ICLR2025)
A framework for few-shot evaluation of language models.
[ACL 2024] An Easy-to-use Knowledge Editing Framework for LLMs.
Robust recipes to align language models with human and AI preferences
HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
A set of case studies for the Wintermute Alpha Challenge
[USENIX Security 2025] PoisonedRAG: Knowledge Corruption Attacks to Retrieval-Augmented Generation of Large Language Models