Ph.D. student at Princeton University, focusing on LLMs, especially Language Modeling and Pretraining, LLM Reasoning, and Reinforcement Learning.
Homepage: https://yifzhang.com
Ph.D. student at Princeton University, focusing on LLMs, especially Language Modeling and Pretraining, LLM Reasoning, and Reinforcement Learning.
Homepage: https://yifzhang.com
Official implementation of ACL 2025 Findings paper "Autonomous Data Selection with Zero-shot Generative Classifiers for Mathematical Texts" (As Huggingface Daily Papers: https://huggingface.co/pape…
The official implementation of TPA: Tensor ProducT ATTenTion Transformer (T6) (https://arxiv.org/abs/2501.06425)
Official implementation of TMLR paper "Cumulative Reasoning With Large Language Models" (https://arxiv.org/abs/2308.04371)
Official implementation of ICML 2025 paper "Beyond Bradley-Terry Models: A General Preference Model for Language Model Alignment" (https://arxiv.org/abs/2410.02197)
The official implementation of Regularized Policy Gradient (RPG) (https://arxiv.org/abs/2505.17508)
Python 35
Official implementation of paper "Meta Prompting for AI Systems" (https://arxiv.org/abs/2311.11482)