Skip to content
View huaxiuyao's full-sized avatar

Block or report huaxiuyao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

MMedPO: Aligning Medical Vision-Language Models with Clinical-Aware Multimodal Preference Optimization

Python 15 Updated Dec 14, 2024

GRAPE: Guided-Reinforced Vision-Language-Action Preference Optimization

Python 68 3 Updated Dec 15, 2024

The paper collections for the autoregressive models in vision.

368 13 Updated Jan 16, 2025

Code for paper "CREAM: Consistency Regularized Self-Rewarding Language Models".

7 1 Updated Oct 15, 2024

[arXiv'24 & NeurIPSW'24] MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language Models

Python 85 8 Updated Dec 17, 2024

MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models

Python 26 3 Updated Nov 3, 2024

[EMNLP'24] RULE: Reliable Multimodal RAG for Factuality in Medical Vision Language Models

Python 56 3 Updated Dec 13, 2024

Official implementation for "MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?"

Jupyter Notebook 41 5 Updated Nov 19, 2024

[NeurIPS'24 & ICMLW'24] CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models

Python 64 4 Updated Dec 4, 2024

[NeurIPS 2024] Calibrated Self-Rewarding Vision Language Models

Python 61 2 Updated Jun 10, 2024

[Arxiv] Aligning Modalities in Vision Large Language Models via Preference Fine-tuning

Python 78 3 Updated Apr 30, 2024

Official implementation for "MJ-BENCH: Is Your Multimodal Reward Model Really a Good Judge?"

Jupyter Notebook 8 Updated Jun 7, 2024

Multimodal Learning Method MLA for CVPR 2024

Python 71 8 Updated Jun 18, 2024

[ICML 2024] Official implementation for "HALC: Object Hallucination Reduction via Adaptive Focal-Contrast Decoding"

Python 79 1 Updated Dec 4, 2024
Python 5 Updated Oct 6, 2023

[ECCV 2024] Official PyTorch Implementation of "How Many Unicorns Are in This Image? A Safety Evaluation Benchmark for Vision LLMs"

Python 76 3 Updated Nov 28, 2023
Python 3 Updated Dec 23, 2023

[ICLR 2024] Analyzing and Mitigating Object Hallucination in Large Vision-Language Models

Python 139 5 Updated Apr 30, 2024
54 1 Updated Apr 1, 2024

Holistic Evaluation of Language Models (HELM), a framework to increase the transparency of language models (https://arxiv.org/abs/2211.09110). This framework is also used to evaluate text-to-image …

Python 2,015 265 Updated Jan 19, 2025

C-Mixup for NeurIPS 2022

Python 68 4 Updated Dec 17, 2023

Benchmark for Natural Temporal Distribution Shift (NeurIPS 2022)

Python 65 8 Updated Mar 29, 2023

MetaMix for ICML 2021

Python 27 6 Updated Jun 9, 2021

LISA for ICML 2022

Python 47 9 Updated Apr 12, 2023
JavaScript 5 4 Updated Jul 23, 2022

KGML for EMNLP 2021

Python 10 Updated Feb 2, 2022

MLTI for ICLR 2022

Python 30 5 Updated May 6, 2022

ATS for NeurIPS 2021

Python 21 1 Updated Nov 4, 2021

FRML for NeurIPS 2021

2 Updated Oct 23, 2021

GFL for AAAI 2020

Python 31 4 Updated Jan 27, 2021
Next