-
Tsinghua University
- Beijing
Highlights
- Pro
Stars
Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems
SpargeAttention: A training-free sparse attention that can accelerate any model inference.
内网穿透工具 基于Python/WebSocket实现, Expose your local services to the internet.
Researchers have made remarkable and groundbreaking achievements in exploring the mechanisms and the fundamental nature of intelligence in AI models, particularly LLMs. This paper repository aims t…
[ICLR 25 Oral] RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style
Machine Learning Engineering Open Book
awesome papers in LLM interpretability
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
Instructions for setting up a Slurm gpu cluster on Ubuntu 22.04.
👩🏿💻👨🏾💻👩🏼💻👨🏽💻👩🏻💻中国独立开发者项目列表 -- 分享大家都在做什么
[Mamba-Survey-2024] Paper list for State-Space-Model/Mamba and it's Applications
A simple and efficient Mamba implementation in pure PyTorch and MLX.
Simple, minimal implementation of the Mamba SSM in one file of PyTorch.
Notes on the Mamba and the S4 model (Mamba: Linear-Time Sequence Modeling with Selective State Spaces)
Large-scale, Informative, and Diverse Multi-round Chat Data (and Models)
A list of totally open alternatives to ChatGPT
We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts…
A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)
Resource, Evaluation and Detection Papers for ChatGPT
The data and source code for the paper "MoocRadar: A Fine-grained and Multi-aspect Knowledge Repository for Improving Cognitive Student Modeling in MOOCs"
A simple Python implementation of ngram sunburst (nested pie chart) visualization showed in CoQA paper
A comprehensive, unified and modular event extraction toolkit.
Do Pre-trained Models Benefit Knowledge Graph Completion? A Reliable Evaluation and a Reasonable Approach
程序员延寿指南 | A programmer's guide to live longer
A reading list for papers on causality for natural language processing (NLP)