Skip to content
View TranSirius's full-sized avatar
  • Tsinghua University
  • Beijing

Highlights

  • Pro

Block or report TranSirius

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems

Python 34 1 Updated Mar 1, 2025

SpargeAttention: A training-free sparse attention that can accelerate any model inference.

Cuda 171 3 Updated Feb 28, 2025

内网穿透工具 基于Python/WebSocket实现, Expose your local services to the internet.

Python 148 38 Updated Dec 24, 2024

Researchers have made remarkable and groundbreaking achievements in exploring the mechanisms and the fundamental nature of intelligence in AI models, particularly LLMs. This paper repository aims t…

8 Updated Feb 17, 2025
Python 11 Updated Jan 30, 2025

[ICLR 25 Oral] RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style

Python 19 Updated Feb 14, 2025

Machine Learning Engineering Open Book

Python 12,991 792 Updated Mar 1, 2025

awesome papers in LLM interpretability

405 12 Updated Jan 14, 2025

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,517 364 Updated Feb 26, 2025

Tools for building GPU clusters

Shell 1,299 335 Updated Jan 16, 2025

Instructions for setting up a Slurm gpu cluster on Ubuntu 22.04.

Shell 20 2 Updated Feb 29, 2024

Material for gpu-mode lectures

Jupyter Notebook 3,850 391 Updated Feb 9, 2025

👩🏿‍💻👨🏾‍💻👩🏼‍💻👨🏽‍💻👩🏻‍💻中国独立开发者项目列表 -- 分享大家都在做什么

38,640 3,203 Updated Feb 28, 2025

[Mamba-Survey-2024] Paper list for State-Space-Model/Mamba and it's Applications

680 38 Updated Feb 23, 2025

A simple and efficient Mamba implementation in pure PyTorch and MLX.

Python 1,135 100 Updated Dec 4, 2024

Simple, minimal implementation of the Mamba SSM in one file of PyTorch.

Python 2,731 200 Updated Mar 8, 2024

Notes on the Mamba and the S4 model (Mamba: Linear-Time Sequence Modeling with Selective State Spaces)

161 12 Updated Jan 7, 2024
Python 318 16 Updated Jul 16, 2024

Large-scale, Informative, and Diverse Multi-round Chat Data (and Models)

Python 2,407 122 Updated Mar 13, 2024

A list of totally open alternatives to ChatGPT

4,592 201 Updated May 3, 2023

We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts…

Jupyter Notebook 2,693 251 Updated Dec 12, 2023

A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)

1,111 58 Updated Jan 4, 2024
Python 59 2 Updated Aug 1, 2023

Resource, Evaluation and Detection Papers for ChatGPT

457 25 Updated Mar 21, 2024

The data and source code for the paper "MoocRadar: A Fine-grained and Multi-aspect Knowledge Repository for Improving Cognitive Student Modeling in MOOCs"

Python 37 2 Updated Oct 7, 2024

A simple Python implementation of ngram sunburst (nested pie chart) visualization showed in CoQA paper

Python 13 4 Updated Mar 12, 2019

A comprehensive, unified and modular event extraction toolkit.

Python 369 36 Updated Dec 18, 2024

Do Pre-trained Models Benefit Knowledge Graph Completion? A Reliable Evaluation and a Reasonable Approach

Python 47 4 Updated Jul 5, 2022

程序员延寿指南 | A programmer's guide to live longer

30,990 2,156 Updated Jan 30, 2024

A reading list for papers on causality for natural language processing (NLP)

616 64 Updated Dec 14, 2024
Next