longzw1997

ZHUI longzw1997

8 followers · 2 following

Achievements

Stars

VITA-MLLM / Freeze-Omni

✨✨Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model with Frozen LLM

Python 286 19 Updated Jan 2, 2025

baaivision / Emu3

Next-Token Prediction is All You Need

Python 2,028 78 Updated Oct 24, 2024

showlab / Show-o

[ICLR 2025] Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.

Python 1,251 55 Updated Mar 12, 2025

showlab / Awesome-Unified-Multimodal-Models

📖 This is a repository for organizing papers, codes and other resources related to unified multimodal models.

397 16 Updated Jan 18, 2025

ChenHsing / Awesome-Video-Diffusion-Models

[CSUR] A Survey on Video Diffusion Models

2,016 104 Updated Dec 9, 2024

VITA-MLLM / VITA

✨✨VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

Python 2,150 164 Updated Feb 13, 2025

BradyFU / Video-MME

✨✨[CVPR 2025] Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis

480 20 Updated Dec 14, 2024

zhourax / VEGA

Python 34 2 Updated Jul 9, 2024

rom1504 / clip-retrieval

Easily compute clip embeddings and build a clip retrieval system with them

Jupyter Notebook 2,506 220 Updated Apr 15, 2024

BradyFU / Woodpecker

✨✨Woodpecker: Hallucination Correction for Multimodal Large Language Models

Python 632 31 Updated Dec 23, 2024

shenyunhang / APE

[CVPR 2024] Aligning and Prompting Everything All at Once for Universal Visual Perception

Python 552 42 Updated May 8, 2024

BradyFU / Awesome-Multimodal-Large-Language-Models

✨✨Latest Advances on Multimodal Large Language Models

14,227 918 Updated Mar 5, 2025

LuckyyySTA / Awesome-LLM-hallucination

LLM hallucination paper list

309 22 Updated Mar 11, 2024

longzw1997 / Open-GroundingDino

This is the third party implementation of the paper Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection.

Python 546 98 Updated Jun 25, 2024

BIGBALLON / distribuuuu

The pure and clear PyTorch Distributed Training Framework.

Python 276 56 Updated Jan 24, 2024

DafaRen / visual-pushing-grasping

Forked from andyzeng/visual-pushing-grasping

Train robotic agents to learn to plan pushing and grasping actions for manipulation with deep reinforcement learning.

Python 1 Updated Feb 4, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ZHUI longzw1997

Achievements

Achievements

Block or report longzw1997

Stars

VITA-MLLM / Freeze-Omni

baaivision / Emu3

showlab / Show-o

showlab / Awesome-Unified-Multimodal-Models

ChenHsing / Awesome-Video-Diffusion-Models

VITA-MLLM / VITA

BradyFU / Video-MME

zhourax / VEGA

rom1504 / clip-retrieval

BradyFU / Woodpecker

shenyunhang / APE

BradyFU / Awesome-Multimodal-Large-Language-Models

LuckyyySTA / Awesome-LLM-hallucination

longzw1997 / Open-GroundingDino

BIGBALLON / distribuuuu

DafaRen / visual-pushing-grasping