Skip to content
View cyj95's full-sized avatar

Block or report cyj95

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🔥 Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos

Python 981 64 Updated Mar 19, 2025

[NAACL 2025 Oral] 🎉 From redundancy to relevance: Enhancing explainability in multimodal large language models

Python 87 6 Updated Feb 13, 2025

[ECCV 2024 Oral] Code for paper: An Image is Worth 1/2 Tokens After Layer 2: Plug-and-Play Inference Acceleration for Large Vision-Language Models

Python 385 15 Updated Jan 4, 2025

Improving Mamaba performance on Video Understanding task

Python 38 5 Updated Oct 20, 2024

[NeurIPS 2024] Official Repository of The Mamba in the Llama: Distilling and Accelerating Hybrid Models

Python 206 18 Updated Mar 3, 2025

Awesome LLM compression research papers and tools.

1,428 91 Updated Mar 20, 2025

MADiff: Motion-Aware Mamba Diffusion Models for Hand Trajectory Prediction on Egocentric Videos

18 1 Updated Sep 6, 2024

[CVPR2024 Highlight] VBench - We Evaluate Video Generation

Python 843 47 Updated Mar 20, 2025

A novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings.

Python 830 53 Updated Feb 26, 2025

[ECCV2024] Official code implementation of Merlin: Empowering Multimodal LLMs with Foresight Minds

Python 90 Updated Jul 4, 2024

When do we not need larger vision models?

Python 380 12 Updated Feb 8, 2025

[ECCV 2024] Official PyTorch implementation code for realizing the technical part of Mixture of All Intelligence (MoAI) to improve performance of numerous zero-shot vision language tasks.

Python 319 34 Updated Mar 28, 2024

Official implementation of the ICASSP-2022 paper "Text2Poster: Laying Out Stylized Texts on Retrieved Images"

Python 210 17 Updated Dec 18, 2023

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 141,749 28,366 Updated Mar 22, 2025

An anomaly detection library comprising state-of-the-art algorithms and features such as experiment management, hyper-parameter optimization, and edge inference.

Python 4,147 719 Updated Mar 21, 2025

The project page of paper: CPM-Nets: Cross Partial Multi-View Networks [NeurIPS 2019 Spotlight paper]

Python 81 27 Updated May 23, 2022
Showing results