A curated list of papers and resources related to Described Object Detection, Open-Vocabulary/Open-World Object Detection and Referring Expression Comprehension. Updated frequently and pull request…

234 18 Updated Dec 12, 2024

Atten4Vis / LW-DETR

This repository is an official implementation of the paper "LW-DETR: A Transformer Replacement to YOLO for Real-Time Detection".

Python 271 19 Updated Jul 25, 2024

ChenDarYen / ArtFusion

ArtFusion: Controllable Arbitrary Style Transfer using Dual Conditional Latent Diffusion Models

Jupyter Notebook 69 4 Updated Jul 26, 2023

cyclomon / UNSB

Official Repository of "Unpaired Image-to-Image Translation via Neural Schrödinger Bridge" (ICLR 2024)

Python 178 11 Updated Apr 30, 2024

microsoft / generative-ai-for-beginners

21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/

Jupyter Notebook 66,202 34,201 Updated Dec 12, 2024

microsoft / GLIP

Grounded Language-Image Pre-training

Python 2,293 197 Updated Jan 24, 2024

clin1223 / VLDet

[ICLR 2023] PyTorch implementation of VLDet （https://arxiv.org/abs/2211.14843）

Python 184 11 Updated Mar 22, 2024

haotian-liu / LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 21,014 2,312 Updated Aug 12, 2024

wusize / F-LMM

Code Release of F-LMM: Grounding Frozen Large Multimodal Models

Python 59 1 Updated Aug 5, 2024

Zehong-Ma / OVMR

OVMR: Open-Vocabulary Recognition with Multi-Modal References (CVPR24)

Python 23 1 Updated Nov 20, 2024

clovaai / ProxyDet

Official implementation of the paper "ProxyDet: Synthesizing Proxy Novel Classes via Classwise Mixup for Open-Vocabulary Object Detection"

Python 23 2 Updated Feb 13, 2024

yuhangzang / ContextDET

Contextual Object Detection with Multimodal Large Language Models

Python 210 5 Updated Oct 14, 2024

UX-Decoder / LLaVA-Grounding

Python 367 14 Updated Jul 29, 2024

sinahmr / NACLIP

PyTorch Implementation of NACLIP in "Pay Attention to Your Neighbours: Training-Free Open-Vocabulary Semantic Segmentation"

Python 44 5 Updated Sep 23, 2024

fcjian / InstaGen

InstaGen: Enhancing Object Detection by Training on Synthetic Dataset, CVPR2024

Jupyter Notebook 74 3 Updated Apr 9, 2024

witnessai / Awesome-Open-Vocabulary-Object-Detection

A curated list of papers, datasets and resources pertaining to open vocabulary object detection.

294 18 Updated Jun 25, 2024