holarissun

Follow

🎯

Focusing

Hao Sun holarissun

🎯

Focusing

Follow

PhD in Reinforcement Learning, LLM Alignment, RLHF

93 followers · 36 following

University of Cambridge
https://holarissun.github.io/
@HolarisSun

Achievements

Achievements

Highlights

Pro

Pinned Loading

RewardModelingBeyondBradleyTerry Public

official implementation of ICLR'2025 paper: Rethinking Bradley-Terry Models in Preference-based Reward Modeling: Foundations, Theory, and Alternatives

Python 55 3
RewardShifting Public

Code for NeurIPS 2022 paper Exploiting Reward Shifting in Value-Based Deep RL

Python 29 3
Prompt-OIRL Public

code for paper Query-Dependent Prompt Evaluation and Optimization with Offline Inverse Reinforcement Learning

Python 40 6
embedding-based-llm-alignment Public

Codebase for Paper Reusing Embeddings: Reproducible Reward Model Research in Large Language Model Alignment without GPUs

Python 12 1
Inverse-RLignment Public

inverse reinforcement learning for LLM alignment

478 contributions in the last year

Learn how we count contributions

Less

More

Activity overview

Contributed to holarissun/RewardModelingBeyondBradleyTerry, holarissun/HandsOnTransformers, holarissun/embedding-based-llm-alignment and 3 other repositories

Contribution activity

April 2025

Created 16 commits in 4 repositories

Created 1 repository

holarissun/Inverse-RLignment
This contribution was made on Apr 15

2 contributions in private repositories Apr 15