Skip to content
View reddyav1's full-sized avatar
  • Johns Hopkins University
  • Baltimore, MD
  • 20:42 (UTC -04:00)

Highlights

  • Pro

Block or report reddyav1

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

This is the official implementation of our paper "Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension"

Python 170 19 Updated Feb 23, 2025

This repo contains source code for Glance and Focus: Memory Prompting for Multi-Event Video Question Answering (Accepted in NeurIPS 2023)

Python 26 6 Updated Jun 28, 2024

Pytorch Code for "Unified Coarse-to-Fine Alignment for Video-Text Retrieval" (ICCV 2023)

Python 63 1 Updated Jun 7, 2024

Official PyTorch implementation of the paper "Revisiting Temporal Modeling for CLIP-based Image-to-Video Knowledge Transferring"

Python 99 3 Updated Jan 28, 2024
Python 29 2 Updated Aug 14, 2023

An official pytorch implementation of the paper: [MV-Adapter: Multimodal Video Transfer Learning for Video Text Retrieval].

Python 13 2 Updated Jul 27, 2024

Source code of our CVPR2024 paper TeachCLIP for Text-to-Video Retrieval

Python 31 1 Updated Mar 3, 2025

Video Summarization With Spatiotemporal Vision Transformer

Python 22 7 Updated Jul 5, 2023
Python 52 2 Updated Jun 4, 2024

A lightweight library to support the development of applications using LLMs

Python 5 Updated Apr 25, 2024
Python 16 Updated Jul 26, 2023

Code release for ActionFormer (ECCV 2022)

Python 480 82 Updated Apr 11, 2024

Ego4D Goal-Step: Toward Hierarchical Understanding of Procedural Activities (NeurIPS 2023)

Python 40 Updated Apr 15, 2024

Official repo for BMVC2021 paper ASFormer: Transformer for action segmentation

Python 104 19 Updated Feb 19, 2022

End to End Streaming Video Temporal Segmentation

Python 25 6 Updated Mar 10, 2025

Official PyTorch code of GroundVQA (CVPR'24)

Python 59 2 Updated Sep 13, 2024
Python 128 20 Updated Jan 3, 2024

Awesome papers & datasets specifically focused on long-term videos.

267 12 Updated Nov 15, 2024

EILeV: Eliciting In-Context Learning in Vision-Language Models for Videos Through Curated Data Distributional Properties

Python 122 9 Updated Nov 10, 2024

Temporal Action Detection & Weakly Supervised Temporal Action Detection & Temporal Action Proposal Generation

480 38 Updated Apr 2, 2025

[CVPR 2022] Progressive Attention on Multi-Level Dense Difference Maps for Generic Event Boundary Detection

Python 50 3 Updated Mar 6, 2023
Python 6 2 Updated Feb 8, 2024

[NeurIPS 2023 D&B] VidChapters-7M: Video Chapters at Scale

Jupyter Notebook 190 21 Updated Nov 13, 2023

Official implementation for "A Simple LLM Framework for Long-Range Video Question-Answering"

Python 94 4 Updated Oct 27, 2024

Code and Dataset for the CVPRW Paper "Where did I leave my keys? — Episodic-Memory-Based Question Answering on Egocentric Videos"

Python 25 Updated Aug 28, 2023

[CVPR 2024] TimeChat: A Time-sensitive Multimodal Large Language Model for Long Video Understanding

Python 358 32 Updated Nov 19, 2024
Python 86 2 Updated Dec 30, 2024

[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.

Python 3,214 261 Updated Jan 18, 2025
Next