Stars
Papers about red teaming LLMs and Multimodal models.
Test your prompts, agents, and RAGs. Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with comma…
Agentic LLM Vulnerability Scanner / AI red teaming kit 🧪
Moonshot - A simple and modular tool to evaluate and red-team any LLM application.
[NeurIPS 2024] Official implementation for "AgentPoison: Red-teaming LLM Agents via Memory or Knowledge Base Backdoor Poisoning"
XRAG: eXamining the Core - Benchmarking Foundational Component Modules in Advanced Retrieval-Augmented Generation
Task-Aware Agent-driven Prompt Optimization Framework
Awesome-llm-role-playing-with-persona: a curated list of resources for large language models for role-playing with assigned personas
Code release for Best-of-N Jailbreaking
A modular graph-based Retrieval-Augmented Generation (RAG) system
"LightRAG: Simple and Fast Retrieval-Augmented Generation"
Concept Induction: Analyzing Unstructured Text with High-Level Concepts Using LLooM (CHI 2024 paper). LLooM automatically surfaces high-level concepts to analyze unstructured text.
Supercharge Your LLM Application Evaluations 🚀
Controllable Text Generation for Large Language Models: A Survey
搜索、推荐、广告、用增等工业界实践文章收集(来源:知乎、Datafuntalk、技术公众号)
该仓库尝试整理推荐系统领域的一些经典算法模型
An Open Source implementation of Notebook LM with more flexibility and features
RefChecker provides automatic checking pipeline and benchmark dataset for detecting fine-grained hallucinations generated by Large Language Models.
QLExpress is a powerful, lightweight, dynamic language for the Java platform aimed at improving developers’ productivity in different business scenes.
A curated list of resources for using LLMs to develop more competitive grant applications.
RAGChecker: A Fine-grained Framework For Diagnosing RAG
Development repository for the Triton language and compiler
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)