Skip to content
View RLHF-V's full-sized avatar

Block or report RLHF-V

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An Open-source RL System from ByteDance Seed and Tsinghua AIR

1,117 47 Updated Apr 10, 2025

Official Repo for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning

Jupyter Notebook 337 23 Updated Dec 15, 2024
Python 314 14 Updated Sep 18, 2024

Official repository of MLLA (NeurIPS 2024)

Python 309 16 Updated Nov 25, 2024
Python 63 3 Updated Feb 5, 2024

[CVPR'25 highlight] RLAIF-V: Open-Source AI Feedback Leads to Super GPT-4V Trustworthiness

Python 350 14 Updated Mar 4, 2025

MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone

Python 19,236 1,389 Updated Mar 3, 2025

React Native font SimSun <宋体> SimHei <黑体> KaiTi<楷体> , support iOS and Android both.

7 2 Updated Mar 15, 2023
1 Updated Apr 15, 2022

https://thu.services

JavaScript 402 58 Updated Apr 5, 2025

[CVPR'24] RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback

Python 276 8 Updated Sep 11, 2024