Skip to content
View rk2900's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report rk2900

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Paper list for Efficient Reasoning.

289 11 Updated Mar 22, 2025

A curated list of resources for activation engineering

45 1 Updated Mar 11, 2025

A resource repository for representation engineering in large language models

115 6 Updated Nov 14, 2024
Python 909 105 Updated Jan 23, 2025

Code for Quiet-STaR

Python 721 89 Updated Aug 21, 2024
Python 51 4 Updated Oct 30, 2024

A high-performance distributed file system designed to address the challenges of AI training and inference workloads.

C++ 8,233 782 Updated Mar 20, 2025

Fully open reproduction of DeepSeek-R1

Python 23,162 2,109 Updated Mar 22, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 5,459 526 Updated Mar 22, 2025

RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.

Python 1,197 84 Updated Mar 22, 2025
Python 7 2 Updated Dec 13, 2024

InfiAgent-DABench: Evaluating Agents on Data Analysis Tasks (ICML 2024)

Python 113 16 Updated Dec 17, 2024

Explorations into some recent techniques surrounding speculative decoding

Python 250 20 Updated Dec 22, 2024

Delphi was the home of a temple to Phoebus Apollo, which famously had the inscription, 'Know Thyself.' This library lets language models know themselves through automated interpretability.

Python 163 23 Updated Mar 21, 2025
Python 129 15 Updated Dec 17, 2024

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 38,167 4,666 Updated Mar 1, 2025

augmented LLM with self reflection

117 6 Updated Nov 21, 2023

OpenMMLab Pose Estimation Toolbox and Benchmark.

Python 6,233 1,320 Updated Aug 7, 2024

Make human motion capture easier.

Python 3,925 472 Updated Feb 26, 2025

Official implementation of DeepLabCut: Markerless pose estimation of user-defined features with deep learning for all animals incl. humans

Python 4,923 1,694 Updated Mar 20, 2025

A repository containing the implementations about our machine learning researches on sequence data and sequential decision making.

Python 104 21 Updated Jan 29, 2024

End-to-end Generative Optimization for AI Agents

Python 521 41 Updated Mar 13, 2025

A probabilitic model for contextual word representation. Accepted to ACL2023 Findings.

Python 23 2 Updated Oct 22, 2023

The machine learning toolkit for time series analysis in Python

Python 2,954 345 Updated Jul 1, 2024

Code repository for TIDMAD: Time series Dataset for Discovering Dark Matter with AI Denoising.

Jupyter Notebook 4 3 Updated Sep 27, 2024

Recipes to train reward model for RLHF.

Python 1,250 91 Updated Feb 9, 2025

The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.

Python 43 6 Updated May 17, 2023

Modeling, training, eval, and inference code for OLMo

Python 5,417 580 Updated Mar 22, 2025

This repository collects all relevant resources about interpretability in LLMs

327 20 Updated Nov 1, 2024

Time-series machine learning at scale. Built with Polars for embarrassingly parallel feature extraction and forecasts on panel data.

Python 1,085 61 Updated Jun 29, 2024
Next