Lists (15)
Sort Name ascending (A-Z)
Stars
Code for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate"
code for paper Hierarchical Retrieval-Augmented Generation Model with Rethink for Multi-hop Question Answering
Awesome-LLM-RAG: a curated list of advanced retrieval augmented generation (RAG) in Large Language Models
基于200万条医疗数据对DeepSeek-R1-Distill-Qwen-32B进行fine tune且部署
Official repository for the paper "NeuZip: Memory-Efficient Training and Inference with Dynamic Compression of Neural Networks". This repository contains the code for the experiments in the paper.
Official codebase for "Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling".
SOTA RL fine-tuning solution for advanced math reasoning of LLM
The RedPajama-Data repository contains code for preparing large datasets for training large language models.
Robust recipes to align language models with human and AI preferences
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Free ChatGPT API Key,免费ChatGPT API,支持GPT4 API(免费),ChatGPT国内可用免费转发API,直连无需代理。可以搭配ChatBox等软件/插件使用,极大降低接口使用成本。国内即可无限制畅快聊天。
Rethinking Chain-of-Thought from the Perspective of Self-Training
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
A library for advanced large language model reasoning
This repository contains sources about reinforcement learning human feedback for math reasoning,.
[EMNLP 2024] Source code for the paper "Learning Planning-based Reasoning with Trajectory Collection and Process Rewards Synthesizing".
open-source code for paper: Retrieval Head Mechanistically Explains Long-Context Factuality
A Comprehensive Toolkit for High-Quality PDF Content Extraction
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
A curated list of LLM Interpretability related material - Tutorial, Library, Survey, Paper, Blog, etc..