Stars
⛹️ Pytorch ReID: A tiny, friendly, strong pytorch implement of person re-id / vehicle re-id baseline. Tutorial 👉https://github.com/layumi/Person_reID_baseline_pytorch/tree/master/tutorial
✨A curated list of papers on the uncertainty in multi-modal large language model (MLLM).
This project aims to collect the latest "call for reviewers" links from various top CS/ML/AI conferences/journals
[ICCV23] Bird’s-Eye-View Scene Graph for Vision-Language Navigation
Official code for our paper "Harnessing Uncertainty-aware Bounding Boxes for Unsupervised 3D Object Detection".
[ECCV 2024] Official implementation of C-Instructor: Controllable Navigation Instruction Generation with Chain of Thought Prompting
This is the official implementation of "Shape2Scene: 3D Scene Representation Learning Through Pre-training on Shape Data" (Accepted at ECCV 2024).
[ECCV24] Navigation Instruction Generation with BEV Perception and Large Language Models
Implementation for "DeltaPhi: Learning Physical Trajectory Residual for PDE Solving"
Official implementation for SlimmeRF: Slimmable Radiance Fields (3DV 2024 Best Paper)
[ICCV,2023]Sample-adaptive Augmentation for Point Cloud Recognition Against Real-world Corruptions
An offical repo for ECCV 2024 Towards Natural Language-Guided Drones: GeoText-1652 Benchmark with Spatial Relation Matching
Offical repo for ECCV 2024: Depth-Aware Blind Image Decomposition for Real-World Weather Recovery
Official repo for our ECCV'24 paper: Approaching Outside: Scaling Unsupervised 3D Object Detection from 2D Scene.
[TPAMI 2023] Local-Global Context Aware Transformer for Language-Guided Video Segmentation
This is the official implementation of "Clustering Propagation for Universal Medical Image Segmentation" (Accepted at CVPR 2024).
[CVPR24] Volumetric Environment Representation for Vision-Language Navigation
[CVPR'24] Neural Clustering based Visual Representation Learning
chen742 / FEC
Forked from guikunchen/FEC[CVPR'24] Neural Clustering based Visual Representation Learning
Official repository of DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models
[AAAI 2024] DGL: Dynamic Global-Local Prompt Tuning for Text-Video Retrieval.
Repository for the paper : ME-D2N: Multi-Expert Domain Decompositional Network for Cross-Domain Few-Shot Learning
Official code for ICCV 2023 paper: "TransHuman: A Transformer-based Human Representation for Generalizable Neural Human Rendering".
The official code of "Towards Unified Text-based Person Retrieval: A Large-scale Multi-Attribute and Language Search Benchmark"
Next-generation Video instance recognition framework on top of Detectron2 which supports InstMove (CVPR 2023), SeqFormer(ECCV Oral), and IDOL(ECCV Oral))