LXDxmu

LXDxmu

Xiamen University

Stars

MAC-AutoML / QuoTA

This is the official implementation of our paper "QuoTA: Query-oriented Token Assignment via CoT Query Decouple for Long Video Comprehension"

Python 61 Updated Mar 15, 2025

LVUGAI / CHiP

[ICLR 2025] CHiP: Cross-modal Hierarchical Direct Preference Optimization for Multimodal LLMs

Python 8 Updated Feb 20, 2025

chencn2020 / SEAGULL

Official implementation for "Seagull: No-reference Image Quality Assessment for Regions of Interest via Visual-Language Instruction Tuning"

Python 39 5 Updated Mar 7, 2025

xjtupanda / Sparrow

Repo for paper "T2Vid: Translating Long Text into Multi-Image is the Catalyst for Video-LLMs"

49 Updated Mar 11, 2025

Theia-4869 / FasterVLM

Official code for paper: [CLS] Attention is All You Need for Training-Free Visual Token Pruning: Make VLM Inference Faster.

Python 54 1 Updated Dec 14, 2024

THUNLP-MT / Brote

Python 11 Updated Jan 19, 2025

showlab / Show-o

[ICLR 2025] Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.

Python 1,256 56 Updated Mar 12, 2025

Liuziyu77 / MIA-DPO

Official implement of MIA-DPO

Python 54 2 Updated Jan 23, 2025

1zhou-Wang / MemVR

Official implementation of paper 'Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in Multimodal Large Language Models'.

Python 44 2 Updated Feb 25, 2025

yh-hust / PDF-Wukong

【ArXiv】PDF-Wukong: A Large Multimodal Model for Efficient Long PDF Reading with End-to-End Sparse Sampling

114 4 Updated Oct 18, 2024

yu-rp / apiprompting

[ECCV 2024] API: Attention Prompting on Image for Large Vision-Language Models

Python 76 6 Updated Oct 10, 2024

AFeng-x / Draw-and-Understand

[ICLR2025] Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want

Python 66 2 Updated Jan 27, 2025

pkunlp-icler / MIC

Forked from HaozheZhao/MIC

MMICL, a state-of-the-art VLM with the in context learning ability from ICL, PKU

Python 46 Updated Sep 29, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LXDxmu

Block or report LXDxmu

Stars

MAC-AutoML / QuoTA

LVUGAI / CHiP

chencn2020 / SEAGULL

xjtupanda / Sparrow

Theia-4869 / FasterVLM

THUNLP-MT / Brote

showlab / Show-o

Liuziyu77 / MIA-DPO

1zhou-Wang / MemVR

yh-hust / PDF-Wukong

yu-rp / apiprompting

AFeng-x / Draw-and-Understand

pkunlp-icler / MIC

VPGTrans / VPGTrans

DCDmllm / Cheetah

Oryx-mllm / Oryx

penghao-wu / vstar

DreamMr / HR-Bench

FreedomIntelligence / LongLLaVA

ggg0919 / cantor

baofff / U-ViT