haozheji

Haozhe Ji haozheji

PhD student @ Tsinghua University HFer | THU EE | THU CoAI

97 followers · 53 following

Beijing, China
05:52 (UTC +08:00)
haozheji.github.io
@HaozJi

Achievements

x2 x2

Achievements

x2 x2

Highlights

Stars

deepseek-ai / DeepEP

DeepEP: an efficient expert-parallel communication library

Cuda 7,479 723 Updated Apr 22, 2025

ttumiel / minRLHF

Minimal RLHF implementation built on top of minGPT.

Python 29 2 Updated Jul 4, 2024

HarleyCoops / Math-To-Manim

Create Epic Math and Physics Animations From Text.

Python 931 103 Updated Mar 30, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 7,079 778 Updated Apr 24, 2025

HolmesShuan / Bias-Variance-Decomposition-for-KL-Divergence

This repository includes some detailed proofs of "Bias Variance Decomposition for KL Divergence".

4 Updated Sep 25, 2021

NVIDIA / NeMo-Aligner

Scalable toolkit for efficient model alignment

Python 773 97 Updated Apr 23, 2025

tatsu-lab / alpaca_eval

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

Jupyter Notebook 1,725 267 Updated Dec 27, 2024

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & LoRA & vLLM & RFT)

Python 6,407 630 Updated Apr 24, 2025

d-tiapkin / gflownet-rl

Repository for "Generative Flow Networks as Entropy-Regularized RL" (AISTATS-2024, Oral)

Python 34 1 Updated Apr 21, 2024

princeton-nlp / SimPO

[NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward

Python 876 61 Updated Feb 16, 2025

Renovamen / oh-my-cv

An in-browser, local-first Markdown resume builder.

TypeScript 536 99 Updated Jul 11, 2024

EleutherAI / w2s

Python 23 4 Updated Sep 24, 2024

Jonathan-LeRoux / IguanaTex

A PowerPoint add-in to insert LaTeX equations into PowerPoint presentations on Windows and Mac

VBA 1,022 68 Updated Jan 30, 2025

coin-or / pulp

A python Linear Programming API

Python 2,234 406 Updated Apr 16, 2025

gpakosz / .tmux

Oh my tmux! My self-contained, pretty & versatile tmux configuration made with 💛🩷💙🖤❤️🤍

Shell 22,880 3,420 Updated Apr 2, 2025

srush / Triton-Puzzles

Puzzles for learning Triton

Jupyter Notebook 1,594 126 Updated Nov 18, 2024

xai-org / grok-1

Grok open release

Python 50,236 8,346 Updated Aug 30, 2024

Yangyi-Chen / Multimodal-AND-Large-Language-Models

Paper list about multimodal and large language models, only used to record papers I read in the daily arxiv for personal needs.

621 42 Updated Apr 24, 2025

thu-coai / TaiLr

ICLR2023 - Tailoring Language Generation Models under Total Variation Distance

Python 21 1 Updated Feb 8, 2023

jzhang38 / LongMamba

Some preliminary explorations of Mamba's context scaling.

Python 212 10 Updated Feb 8, 2024

ekalinin / github-markdown-toc

Easy TOC creation for GitHub README.md

Shell 3,264 2,742 Updated Oct 12, 2024

deepspeedai / DeepSpeedExamples

Example models using DeepSpeed

Python 6,458 1,087 Updated Apr 20, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 45,692 7,057 Updated Apr 24, 2025

huggingface / alignment-handbook

Robust recipes to align language models with human and AI preferences

Python 5,143 442 Updated Nov 21, 2024

CarperAI / trlx

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Python 4,629 476 Updated Jan 8, 2024

eric-mitchell / direct-preference-optimization

Reference implementation for DPO (Direct Preference Optimization)

Python 2,538 207 Updated Aug 11, 2024

stanfordnlp / dspy

DSPy: The framework for programming—not prompting—language models

Python 23,706 1,825 Updated Apr 24, 2025

opendilab / awesome-RLHF

A curated list of reinforcement learning with human feedback resources (continually updated)

3,901 239 Updated Feb 19, 2025

meta-llama / codellama

Inference code for CodeLlama models

Python 16,278 1,909 Updated Aug 12, 2024

kitao / pyxel

A retro game engine for Python

Rust 16,202 873 Updated Apr 17, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Haozhe Ji haozheji

Achievements

Achievements

Highlights

Block or report haozheji

Stars

deepseek-ai / DeepEP

ttumiel / minRLHF

HarleyCoops / Math-To-Manim

volcengine / verl

HolmesShuan / Bias-Variance-Decomposition-for-KL-Divergence

NVIDIA / NeMo-Aligner

tatsu-lab / alpaca_eval

OpenRLHF / OpenRLHF

d-tiapkin / gflownet-rl

princeton-nlp / SimPO

Renovamen / oh-my-cv

EleutherAI / w2s

Jonathan-LeRoux / IguanaTex

coin-or / pulp

gpakosz / .tmux

srush / Triton-Puzzles

xai-org / grok-1

Yangyi-Chen / Multimodal-AND-Large-Language-Models

thu-coai / TaiLr

jzhang38 / LongMamba

ekalinin / github-markdown-toc

deepspeedai / DeepSpeedExamples

vllm-project / vllm

huggingface / alignment-handbook

CarperAI / trlx

eric-mitchell / direct-preference-optimization

stanfordnlp / dspy

opendilab / awesome-RLHF

meta-llama / codellama

kitao / pyxel