RLHF-V

RLHF-V

21 followers · 0 following

Achievements

Stars

BytedTsinghua-SIA / DAPO

An Open-source RL System from ByteDance Seed and Tsinghua AIR

1,117 47 Updated Apr 10, 2025

RL4VLM / RL4VLM

Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning

Jupyter Notebook 337 23 Updated Dec 15, 2024

OpenBMB / Eurus

Python 314 14 Updated Sep 18, 2024

LeapLabTHU / MLLA

Official repository of MLLA (NeurIPS 2024)

Python 309 16 Updated Nov 25, 2024

thunlp / Muffin

Python 63 3 Updated Feb 5, 2024

RLHF-V / RLAIF-V

[CVPR'25 highlight] RLAIF-V: Open-Source AI Feedback Leads to Super GPT-4V Trustworthiness

Python 350 14 Updated Mar 4, 2025

OpenBMB / MiniCPM-o

MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone

Python 19,236 1,389 Updated Mar 3, 2025

flyskywhy / react-native-font-sim

React Native font SimSun <宋体> SimHei <黑体> KaiTi<楷体> , support iOS and Android both.

7 2 Updated Mar 15, 2023

dtde / simhei

1 Updated Apr 15, 2022

thuservices / thuservices

https://thu.services

JavaScript 402 58 Updated Apr 5, 2025

RLHF-V / RLHF-V

[CVPR'24] RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback

Python 276 8 Updated Sep 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RLHF-V

Achievements

Achievements

Block or report RLHF-V

Stars

BytedTsinghua-SIA / DAPO

RL4VLM / RL4VLM

OpenBMB / Eurus

LeapLabTHU / MLLA

thunlp / Muffin

RLHF-V / RLAIF-V

OpenBMB / MiniCPM-o

flyskywhy / react-native-font-sim

dtde / simhei

thuservices / thuservices

RLHF-V / RLHF-V