Skip to content
View Battam1111's full-sized avatar
๐ŸŽฏ
Focusing
๐ŸŽฏ
Focusing

Organizations

@polyunlp

Block or report Battam1111

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
battam1111/README.md

Hi there, I'm Yanjun Chen (้™ˆๅฝฆ็ญ ) ๐Ÿ‘‹

๐Ÿง  INTJ | ๐Ÿค– RLHF & Embodied AI researcher
๐ŸŽ“ PhD @ Hong Kong Polytechnic University & EIT | ๐Ÿ‡ญ๐Ÿ‡ฐ Based in Hong Kong

Website GitHub Email


๐Ÿงฌ About Me

  • ๐Ÿ‡จ๐Ÿ‡ณ From China, currently living in ๐Ÿ‡ญ๐Ÿ‡ฐ Hong Kong
  • ๐Ÿซ PhD student at HK PolyU, collaborating with EIT
  • ๐Ÿ”ฌ Focused on Reinforcement Learning with Human Feedback (RLHF) and Embodied AI

๐Ÿง  Personality & Values

  • ๐Ÿงญ MBTI: INTJ โ€” Strategic Architect
  • ๐Ÿ“ I thrive on clarity, structure, and deep reasoning
  • ๐Ÿ’ก Always seeking elegance over brute-force

๐Ÿ› ๏ธ Skills & Tools

Programming Languages AI Fields
Python ๐Ÿ, C/C++ โš™๏ธ Chinese ๐Ÿ‡จ๐Ÿ‡ณ, English ๐Ÿ‡ฌ๐Ÿ‡ง, Japanese ๐Ÿ‡ฏ๐Ÿ‡ต RL, RLHF, Embodied AI, LLMs

๐Ÿ’ผ Projects / Research

You can find my full list of publications, research overviews, and blog posts here:

๐Ÿ”— Personal Site โ†’ https://battam1111.github.io


โœจ Fun Facts

  • ๐Ÿ“ Table tennis player
  • ๐ŸŽฎ Gamer at heart
  • ๐ŸŽค KTV Enthusiast
  • ๐Ÿ” Perpetual learner (especially in science & philosophy)

๐Ÿ“ฌ Contact Me


Pinned Loading

  1. AccuracyParadox-RLHF AccuracyParadox-RLHF Public

    [EMNLP 2024 Main] Official implementation of the paper "The Accuracy Paradox in RLHF: When Better Reward Models Don't Yield Better Language Models".

    Python 8

  2. YJ-SACR YJ-SACR Public

    Jupyter Notebook 1

  3. MCTSV MCTSV Public

    Python 3

  4. DeepSC-Implement DeepSC-Implement Public

    Forked from 13274086/DeepSC

    Pytorch implementation of the DeepSC

    Python