Battam1111

Follow

🎯

Focusing

battam Battam1111

🎯

Focusing

Follow

PhD student @ PolyU & EIT

13 followers · 17 following

Hong Kong Polytechnic University
Hong Kong
06:32 (UTC +08:00)
https://battam1111.github.io/

Organizations

battam1111/README.md

Hi there, I'm Yanjun Chen (陈彦筠) 👋

🧠 INTJ | 🤖 RLHF & Embodied AI researcher
🎓 PhD @ Hong Kong Polytechnic University & EIT | 🇭🇰 Based in Hong Kong

🧬 About Me

🇨🇳 From China, currently living in 🇭🇰 Hong Kong
🏫 PhD student at HK PolyU, collaborating with EIT
🔬 Focused on Reinforcement Learning with Human Feedback (RLHF) and Embodied AI

🧠 Personality & Values

🧭 MBTI: INTJ — Strategic Architect
📐 I thrive on clarity, structure, and deep reasoning
💡 Always seeking elegance over brute-force

🛠️ Skills & Tools

Programming	Languages	AI Fields
Python 🐍, C/C++ ⚙️	Chinese 🇨🇳, English 🇬🇧, Japanese 🇯🇵	RL, RLHF, Embodied AI, LLMs

💼 Projects / Research

You can find my full list of publications, research overviews, and blog posts here:

🔗 Personal Site → https://battam1111.github.io

✨ Fun Facts

🏓 Table tennis player
🎮 Gamer at heart
🎤 KTV Enthusiast
🔍 Perpetual learner (especially in science & philosophy)

📬 Contact Me

📧 Email: [email protected]
💬 WeChat: xzqm13143609845
🧠 Google Scholar: My Publications

Pinned Loading

AccuracyParadox-RLHF AccuracyParadox-RLHF Public

[EMNLP 2024 Main] Official implementation of the paper "The Accuracy Paradox in RLHF: When Better Reward Models Don't Yield Better Language Models".

Python 8
YJ-SACR YJ-SACR Public

Jupyter Notebook 1
MCTSV MCTSV Public

Python 3
DeepSC-Implement DeepSC-Implement Public

Forked from 13274086/DeepSC

Pytorch implementation of the DeepSC

Python