BradyFU

👋

Chaoyou Fu BradyFU

👋

南京大学-研究员|助理教授|博导-中国科协青年人才托举工程|中科院院长特别奖

557 followers · 4 following

Lead NJU-MiG (Multimodal intelligence Group, 南京大学米格小组), VITA, MME, and Awesome-MLLM
https://bradyfu.github.io/

Achievements

Organizations

Stars

MME-Benchmarks / MME-Unify

MME-Unify: A Comprehensive Benchmark for Unified Multimodal Understanding and Generation Models

Python 31 1 Updated Apr 10, 2025

LengSicong / MMR1

MMR1: Advancing the Frontiers of Multimodal Reasoning

154 5 Updated Mar 17, 2025

VITA-MLLM / Sparrow

Sparrow: Data-Efficient Video-LLM with Text-to-Image Augmentation

Jupyter Notebook 28 Updated Mar 28, 2025

Leon1207 / Video-RAG-master

This is the official implementation of our paper "Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension"

Python 171 19 Updated Feb 23, 2025

MAC-AutoML / QuoTA

This is the official implementation of our paper "QuoTA: Query-oriented Token Assignment via CoT Query Decouple for Long Video Comprehension"

Python 68 2 Updated Mar 16, 2025

Phantom-video / Phantom

Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment

CSS 547 36 Updated Mar 19, 2025

Kwai-YuanQi / MM-RLHF

The Next Step Forward in Multimodal LLM Alignment

Python 145 4 Updated Mar 5, 2025

MME-Benchmarks / MME-CoT

MME-CoT: Benchmarking Chain-of-Thought in LMMs for Reasoning Quality, Robustness, and Efficiency

Python 95 3 Updated Mar 29, 2025

VITA-MLLM / LUCY

LUCY: Linguistic Understanding and Control Yielding Early Stage of Her

Python 36 3 Updated Apr 14, 2025

VITA-MLLM / Long-VITA

✨✨Long-VITA: Scaling Large Multi-modal Models to 1 Million Tokens with Leading Short-Context Accuracy

Python 272 29 Updated Mar 20, 2025

xjtupanda / Sparrow

Repo for paper "T2Vid: Translating Long Text into Multi-Image is the Catalyst for Video-LLMs"

49 Updated Mar 11, 2025

HiThink-Research / MME-Finance

Python 32 3 Updated Jan 9, 2025

VITA-MLLM / Freeze-Omni

✨✨Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model with Frozen LLM

Python 305 19 Updated Jan 2, 2025

NVlabs / EAGLE

Eagle Family: Exploring Model Designs, Data Recipes and Training Strategies for Frontier-Class Multimodal LLMs

Python 661 40 Updated Apr 18, 2025

MME-Benchmarks / MME-RealWorld

✨✨ [ICLR 2025] MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?

Python 109 8 Updated Mar 4, 2025

VITA-MLLM / VITA

✨✨VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

Python 2,235 166 Updated Mar 28, 2025

jinzhuoran / RWKU

RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language Models. NeurIPS 2024

Python 72 7 Updated Sep 30, 2024

yfzhang114 / SliME

✨✨Beyond LLaVA-HD: Diving into High-Resolution Large Multimodal Models

Python 155 7 Updated Dec 26, 2024

seanzhuh / Awesome-Open-Vocabulary-Detection-and-Segmentation

Awesome OVD-OVS - A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and Future

171 8 Updated Apr 3, 2025

YifanXu74 / Libra

Simple PyTorch implementation of "Libra: Building Decoupled Vision System on Large Language Models" (accepted by ICML 2024)

Python 154 2 Updated Nov 29, 2024

MME-Benchmarks / Video-MME

✨✨[CVPR 2025] Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis

525 20 Updated Apr 17, 2025

zhourax / VEGA

Python 36 2 Updated Jul 9, 2024

alibaba / conv-llava

Python 115 3 Updated Jul 29, 2024

ggg0919 / cantor

HTML 82 8 Updated May 10, 2024

dvlab-research / MGM

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Python 3,270 282 Updated May 4, 2024

MzeroMiko / VMamba

VMamba: Visual State Space Models，code is based on mamba

Python 2,543 173 Updated Mar 7, 2025

shenyunhang / APE

[CVPR 2024] Aligning and Prompting Everything All at Once for Universal Visual Perception

Python 563 42 Updated May 8, 2024

dvlab-research / LLaMA-VID

LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models (ECCV 2024)

Python 802 43 Updated Jul 29, 2024

hustvl / 4DGaussians

[CVPR 2024] 4D Gaussian Splatting for Real-Time Dynamic Scene Rendering

Jupyter Notebook 2,565 221 Updated Oct 27, 2024

hustvl / GaussianDreamer

[CVPR 2024] GaussianDreamer: Fast Generation from Text to 3D Gaussians by Bridging 2D and 3D Diffusion Models

Python 754 39 Updated Jan 13, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Chaoyou Fu BradyFU

Achievements

Achievements

Organizations

Block or report BradyFU

Stars

MME-Benchmarks / MME-Unify

LengSicong / MMR1

VITA-MLLM / Sparrow

Leon1207 / Video-RAG-master

MAC-AutoML / QuoTA

Phantom-video / Phantom

Kwai-YuanQi / MM-RLHF

MME-Benchmarks / MME-CoT

VITA-MLLM / LUCY

VITA-MLLM / Long-VITA

xjtupanda / Sparrow

HiThink-Research / MME-Finance

VITA-MLLM / Freeze-Omni

NVlabs / EAGLE

MME-Benchmarks / MME-RealWorld

VITA-MLLM / VITA

jinzhuoran / RWKU

yfzhang114 / SliME

seanzhuh / Awesome-Open-Vocabulary-Detection-and-Segmentation

YifanXu74 / Libra

MME-Benchmarks / Video-MME

zhourax / VEGA

alibaba / conv-llava

ggg0919 / cantor

dvlab-research / MGM

MzeroMiko / VMamba

shenyunhang / APE

dvlab-research / LLaMA-VID

hustvl / 4DGaussians

hustvl / GaussianDreamer