Stars
A curated list of foundation models for vision and language tasks
Official source code for "Continual 3D Convolutional Neural Networks for Real-time Processing of Videos" [ECCV2022]
📓 Notes and summaries of various ML, Computer Vision & NLP papers.
Release of the pretrained S3D Network in PyTorch (ECCV 2018)
A curated list of Computer Vision related conferences with dates and paper registration deadlines.
Computer Vision Journals List, Review Speed, Impact Factors
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
Compendium of free ML reading resources
Collection of common code that's shared among different research projects in FAIR computer vision team.
Multi-layer Recurrent Neural Networks (LSTM, GRU, RNN) for character-level language models in Torch
Reading list for research topics in multimodal machine learning
Compact Bilinear Pooling for PyTorch
A Simple pytorch implementation of GradCAM and GradCAM++
Pytorch implementation of various Knowledge Distillation (KD) methods.
[CVPR2021] UAV-Human: A Large Benchmark for Human Behavior Understanding with Unmanned Aerial Vehicles
Code for ECA-Net: Efficient Channel Attention for Deep Convolutional Neural Networks
[ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding
Sequence modeling benchmarks and temporal convolutional networks
A curated paper list of awesome skeleton-based action recognition.
[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted fo…
Code repository for the paper "Learning Discriminative Representations for Skeleton Based Action Recognition". CVPR 2023
Repository for Paper: Cross-Domain Complementary Learning Using Pose for Multi-Person Part Segmentation (TCSVT20)
Common English Vocabulary Word List
Estimates the size of a PyTorch model in memory
PyTorch implementation of MoCo: https://arxiv.org/abs/1911.05722