Starred repositories
Conifer: Improving Complex Constrained Instruction-Following Ability of Large Language Models
[NAACL 2024 Outstanding Paper] Source code for the NAACL 2024 paper entitled "R-Tuning: Instructing Large Language Models to Say 'I Don't Know'"
🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)
A high-throughput and memory-efficient inference and serving engine for LLMs
This is the repo for the survey of Bias and Fairness in IR with LLMs.
A general fine-tuning kit geared toward diffusion models.
m&ms: A Benchmark to Evaluate Tool-Use for multi-step multi-modal tasks
This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?
Code for "FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Models (ACL 2024)"
Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)
[ICML 2023] Data and code release for the paper "DS-1000: A Natural and Reliable Benchmark for Data Science Code Generation".
Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
The #1 open-source voice interface for desktop, mobile, and ESP32 chips.
LLM based autonomous agent that conducts local and web research on any topic and generates a comprehensive report with citations.
Collection of China illegal cases about web crawler 本项目用来整理所有中国大陆爬虫开发者涉诉与违规相关的新闻、资料与法律法规。致力于帮助在中国大陆工作的爬虫行业从业者了解我国相关法律,避免触碰数据合规红线。 [AD]中文知识图谱门户
A guidance language for controlling large language models.
[ACL 2024] AUTOACT: Automatic Agent Learning from Scratch for QA via Self-Planning
Implementation of "RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon Generation".
Enhanced ChatGPT Clone: Features Agents, Anthropic, AWS, OpenAI, Assistants API, Azure, Groq, o1, GPT-4o, Mistral, OpenRouter, Vertex AI, Gemini, Artifacts, AI model switching, message search, Code…
Get up and running with Llama 3.3, Phi 4, Gemma 2, and other large language models.
[NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other models