Stars
[ArXiv2024] ModPrompt: Visual Modality Prompt for Adapting Vision-Language Object Detectors
Awesome List of Vision Language Prompt Papers
Test-time Prompt Tuning (TPT) for zero-shot generalization in vision-language models (NeurIPS 2022))
Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
Collection of AWESOME vision-language models for vision tasks
Collection of awesome test-time (domain/batch/instance) adaptation methods
This is the third party implementation of the paper Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection.
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
[WACV2025] MiPa: Mixed Patch Infrared-Visible Modality Agnostic Object Detection
[ECCV2024] ModTr: Modality Translation for Object Detection Adaptation Without Forgetting Prior Knowledge
Awesome list for research on CLIP (Contrastive Language-Image Pre-Training).
Codes to complement YouTube videos and blog posts on Medium.
A latent text-to-image diffusion model
A curated list of foundation models for vision and language tasks
[WACV2024] HalluciDet: Hallucinating RGB Modality for Person Detection Through Privileged Information (Accepted at WACV 2024 and LatinX@CVPR2024 Extended Abstract)
Awesome-LLM: a curated list of Large Language Model
Official implementation of the CVPR 2023 paper "Harmonious Teacher for Cross-domain Object Detection"
A Collection of Domain Adaptation for Object Detection Material
converting bdd100k json to pascal voc style xml files
Convert between visual object detection datasets
[AAAI' 22 ORAL] SCAN: Cross Domain Object Detection with Semantic Conditioned Adaptation
[CVPR' 22 ORAL] SIGMA: Semantic-complete Graph Matching for Domain Adaptative Object Detection