Highlights
- Pro
Stars
SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks
[ICLR 2025] Released code for paper "Spurious Forgetting in Continual Learning of Language Models"
A series of technical report on Slow Thinking with LLM
Awesome-Jailbreak-on-LLMs is a collection of state-of-the-art, novel, exciting jailbreak methods on LLMs. It contains papers, codes, datasets, evaluations, and analyses.
[ICML 2024] COLD-Attack: Jailbreaking LLMs with Stealthiness and Controllability
veRL: Volcano Engine Reinforcement Learning for LLM
A toolkit for describing model features and intervening on those features to steer behavior.
Code and results accompanying the paper "Refusal in Language Models Is Mediated by a Single Direction".
NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.
Moonshot - A simple and modular tool to evaluate and red-team any LLM application.
Set of tools to assess and improve LLM security.
The Python Risk Identification Tool for generative AI (PyRIT) is an open source framework built to empower security professionals and engineers to proactively identify risks in generative AI systems.
🐢 Open-Source Evaluation & Testing for AI & LLM systems
Submission Guide + Discussion Board for AI Singapore Global Challenge for Safe and Secure LLMs (Track 2A).
[ICML 2024] TrustLLM: Trustworthiness in Large Language Models
A reading list for large models safety, security, and privacy (including Awesome LLM Security, Safety, etc.).
Official Pytorch implementation of "Interpreting and Editing Vision-Language Representations to Mitigate Hallucinations" (ICLR '25)
✨✨Latest Advances on Multimodal Large Language Models
[NeurIPS 2024] Classification Done Right for Vision-Language Pre-Training
SpeechGPT Series: Speech Large Language Models
Inference code for the paper "Spirit-LM Interleaved Spoken and Written Language Model".
The repoduction codes for Qwen-Audio Fine-tuning
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities。