hrtang22

H.R. Tang hrtang22

Ph.D student in Peking University

Starred repositories

littlespray / VE-Bench

[AAAI 25] Official Implementation for ”E-Bench: Subjective-Aligned Benchmark Suite for Text-Driven Video Editing Quality Assessment“

Python 32 1 Updated Dec 31, 2024

Tencent / HunyuanVideo

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 7,108 537 Updated Jan 2, 2025

PhysGame / PhysGame

PhysGame Benchmark for Physical Commonsense Evaluation in Gameplay Videos

Python 35 1 Updated Dec 13, 2024

farewellthree / PPLLaVA

Official GPU implementation of the paper "PPLLaVA: Varied Video Sequence Understanding With Prompt Guidance"

Python 117 5 Updated Nov 19, 2024

hrtang22 / MUSE

Code implementation of paper "MUSE: Mamba is Efficient Multi-scale Learner for Text-video Retrieval"

Python 15 Updated Sep 8, 2024

TencentARC / ST-LLM

[ECCV 2024🔥] Official implementation of the paper "ST-LLM: Large Language Models Are Effective Temporal Learners"

Python 133 4 Updated Sep 10, 2024

beichenzbc / Long-CLIP

[ECCV 2024] official code for "Long-CLIP: Unlocking the Long-Text Capability of CLIP"

Python 721 36 Updated Aug 13, 2024

PKU-YuanGroup / Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 11,850 1,039 Updated Dec 31, 2024

NVlabs / ConvSSM

Python 63 3 Updated Oct 22, 2024

farewellthree / STAN

Official PyTorch implementation of the paper "Revisiting Temporal Modeling for CLIP-based Image-to-Video Knowledge Transferring"

Python 99 3 Updated Jan 28, 2024

Visual-AI / FROSTER

The official repository for ICLR2024 paper "FROSTER: Frozen CLIP is a Strong Teacher for Open-Vocabulary Action Recognition"

Python 66 5 Updated Dec 3, 2024

XLearning-SCU / 2024-ICLR-Norton

Multi-granularity Correspondence Learning from Long-term Noisy Videos [ICLR 2024, Oral]

Python 111 8 Updated Apr 18, 2024

hustvl / Vim

[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Python 3,135 209 Updated Nov 22, 2024

yunlong10 / Awesome-LLMs-for-Video-Understanding

🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.

1,780 90 Updated Dec 12, 2024

zhenxingjian / Partial_Distance_Correlation

This is the official GitHub for paper: On the Versatile Uses of Partial Distance Correlation in Deep Learning, in ECCV 2022

Jupyter Notebook 172 16 Updated Jun 4, 2023

foolwood / DRL

[arXiv22] Disentangled Representation Learning for Text-Video Retrieval

Python 93 5 Updated Apr 7, 2022

NExT-GPT / NExT-GPT

Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model

Python 3,377 344 Updated Nov 3, 2024

optas / changeit3d

Official pytorch code for "ShapeTalk: A Language Dataset and Framework for 3D Shape Edits and Deformations"

Jupyter Notebook 58 4 Updated Aug 23, 2023

SihengLi99 / TextBind

[2024-ACL]: TextBind: Multi-turn Interleaved Multimodal Instruction-following in the Wildrounded Conversation

Python 47 2 Updated Sep 19, 2023

Yushi-Hu / tifa

TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question Answering

Python 141 9 Updated Apr 29, 2024

thu-ml / unidiffuser

Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"

Python 1,392 87 Updated May 31, 2023

microsoft / i-Code

Jupyter Notebook 1,682 163 Updated Sep 27, 2024

OpenGVLab / LLaMA-Adapter

[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters

Python 5,791 377 Updated Mar 14, 2024

ZiyuGuo99 / Point-Bind_Point-LLM

Align 3D Point Cloud with Multi-modalities for Large Language Models

Python 427 31 Updated Dec 9, 2023

salesforce / GlueGen

Python 62 3 Updated Oct 22, 2023

jpthu17 / EMCL

[NeurIPS 2022 Spotlight] Expectation-Maximization Contrastive Learning for Compact Video-and-Language Representations

Python 125 9 Updated Apr 9, 2024

RUCAIBox / LLMSurvey

The official GitHub page for the survey paper "A Survey of Large Language Models".

Python 10,725 834 Updated Aug 20, 2024

rese1f / MovieChat

[CVPR 2024] MovieChat: From Dense Token to Sparse Memory for Long Video Understanding

Python 566 42 Updated Dec 18, 2024

impiga / Plain-DETR

[ICCV2023] DETR Doesn’t Need Multi-Scale or Locality Design

Python 193 4 Updated Nov 14, 2023

llm-attacks / llm-attacks

Universal and Transferable Attacks on Aligned Language Models

Python 3,553 484 Updated Aug 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly