Lists (7)
Sort Name ascending (A-Z)
Stars
🚀 PR-Agent (Qodo Merge open-source): An AI-Powered 🤖 Tool for Automated Pull Request Analysis, Feedback, Suggestions and More! 💻🔍
Qodo-Cover: An AI-Powered Tool for Automated Test Generation and Code Coverage Enhancement! 💻🤖🧪🐞
🌾 OAT: A research-friendly framework for LLM online alignment, including preference learning, reinforcement learning, etc.
Keep searching, reading webpages, reasoning until it finds the answer (or exceeding the token budget)
[ICLR 25 Oral] RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style
A generalized information-seeking agent system with Large Language Models (LLMs).
The easiest tool for fine-tuning LLM models, synthetic data generation, and collaborating on datasets.
An open-source recreation of the AgentInstruct agentic workflow for synthetic data generation
Curated list of research papers published in 2024 related to Large Language Models (LLM)
Real-time updated, fine-grained reading list on LLM-synthetic-data.🔥
A reading list on LLM based Synthetic Data Generation 🔥
This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data
Synthetic data curation for post-training and structured data extraction
A recipe for online RLHF and online iterative DPO.
Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.
A bibliography and survey of the papers surrounding o1
The most modern LLM evaluation toolkit
Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]
Fully open reproduction of DeepSeek-R1
Dataset for the COLING 2025 accepted paper: "A Testset for Context-Aware LLM Translation in Korean-to-English Discourse Level Translation." This dataset features 600 instances covering six linguist…
The official repository of the paper "(Perhaps) Beyond Human Translation: Harnessing Multi-Agent Collaboration for Translating Ultra-Long Literary Texts"
MapCoder: Multi-Agent Code Generation for Competitive Problem Solving
A series of technical report on Slow Thinking with LLM
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.