Skip to content
View hrtang22's full-sized avatar
  • PKU

Block or report hrtang22

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

[AAAI 25] Official Implementation for ”E-Bench: Subjective-Aligned Benchmark Suite for Text-Driven Video Editing Quality Assessment“

Python 32 1 Updated Dec 31, 2024

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 7,108 537 Updated Jan 2, 2025

PhysGame Benchmark for Physical Commonsense Evaluation in Gameplay Videos

Python 35 1 Updated Dec 13, 2024

Official GPU implementation of the paper "PPLLaVA: Varied Video Sequence Understanding With Prompt Guidance"

Python 117 5 Updated Nov 19, 2024

Code implementation of paper "MUSE: Mamba is Efficient Multi-scale Learner for Text-video Retrieval"

Python 15 Updated Sep 8, 2024

[ECCV 2024🔥] Official implementation of the paper "ST-LLM: Large Language Models Are Effective Temporal Learners"

Python 133 4 Updated Sep 10, 2024

[ECCV 2024] official code for "Long-CLIP: Unlocking the Long-Text Capability of CLIP"

Python 721 36 Updated Aug 13, 2024

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 11,850 1,039 Updated Dec 31, 2024
Python 63 3 Updated Oct 22, 2024

Official PyTorch implementation of the paper "Revisiting Temporal Modeling for CLIP-based Image-to-Video Knowledge Transferring"

Python 99 3 Updated Jan 28, 2024

The official repository for ICLR2024 paper "FROSTER: Frozen CLIP is a Strong Teacher for Open-Vocabulary Action Recognition"

Python 66 5 Updated Dec 3, 2024

Multi-granularity Correspondence Learning from Long-term Noisy Videos [ICLR 2024, Oral]

Python 111 8 Updated Apr 18, 2024

[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Python 3,135 209 Updated Nov 22, 2024

🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.

1,780 90 Updated Dec 12, 2024

This is the official GitHub for paper: On the Versatile Uses of Partial Distance Correlation in Deep Learning, in ECCV 2022

Jupyter Notebook 172 16 Updated Jun 4, 2023

[arXiv22] Disentangled Representation Learning for Text-Video Retrieval

Python 93 5 Updated Apr 7, 2022

Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model

Python 3,377 344 Updated Nov 3, 2024

Official pytorch code for "ShapeTalk: A Language Dataset and Framework for 3D Shape Edits and Deformations"

Jupyter Notebook 58 4 Updated Aug 23, 2023

[2024-ACL]: TextBind: Multi-turn Interleaved Multimodal Instruction-following in the Wildrounded Conversation

Python 47 2 Updated Sep 19, 2023

TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question Answering

Python 141 9 Updated Apr 29, 2024

Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"

Python 1,392 87 Updated May 31, 2023
Jupyter Notebook 1,682 163 Updated Sep 27, 2024

[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters

Python 5,791 377 Updated Mar 14, 2024

Align 3D Point Cloud with Multi-modalities for Large Language Models

Python 427 31 Updated Dec 9, 2023
Python 62 3 Updated Oct 22, 2023

[NeurIPS 2022 Spotlight] Expectation-Maximization Contrastive Learning for Compact Video-and-Language Representations

Python 125 9 Updated Apr 9, 2024

The official GitHub page for the survey paper "A Survey of Large Language Models".

Python 10,725 834 Updated Aug 20, 2024

[CVPR 2024] MovieChat: From Dense Token to Sparse Memory for Long Video Understanding

Python 566 42 Updated Dec 18, 2024

[ICCV2023] DETR Doesn’t Need Multi-Scale or Locality Design

Python 193 4 Updated Nov 14, 2023

Universal and Transferable Attacks on Aligned Language Models

Python 3,553 484 Updated Aug 2, 2024
Next