Stars
Building Open-Ended Embodied Agents with Internet-Scale Knowledge
Recipes to train reward model for RLHF.
Simple extension on vLLM to help you speed up reasoning model without training.
A Survey on Large Language Model-Based Game Agents
PantheonRL is a package for training and testing multi-agent reinforcement learning environments. PantheonRL supports cross-play, fine-tuning, ad-hoc coordination, and more.
RAGEN is the first open-source reproduction of DeepSeek-R1 on AGENT training.
This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data
📖A curated list of Awesome LLM/VLM Inference Papers with codes: WINT8/4, Flash-Attention, Paged-Attention, Parallelism, etc. 🎉🎉
A reading list on LLM based Synthetic Data Generation 🔥
Empowering RAG with a memory-based data interface for all-purpose applications!
GPT4 based personalized ArXiv paper assistant bot
Implementation of Nougat Neural Optical Understanding for Academic Documents
KAG is a logical form-guided reasoning and retrieval framework based on OpenSPG engine and LLMs. It is used to build logical reasoning and factual Q&A solutions for professional domain knowledge ba…
Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! 🦥
[EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs
A system for agentic LLM-powered data processing and ETL
The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval
ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)
A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.
📘 OpenAPI/Swagger-generated API Reference Documentation
Access models from OpenAI, Groq, local Ollama, and others by setting llm-router as Cursor's Base URL
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
An automated pipeline for evaluating LLMs for role-playing.