Stars
Official implementation of OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion
[CVPR2024] Generative Region-Language Pretraining for Open-Ended Object Detection
A General-purpose Person Re-identification Task with Instructions
A new framework for open-vocabulary object detection, based on maskrcnn-benchmark
(NeurIPS2023) CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection
C++ library based on tensorrt integration
D-FINE: Redefine Regression Task of DETRs as Fine-grained Distribution Refinement [ICLR 2025 Spotlight]
[CVPR'24 Highlight] SHiNe: Semantic Hierarchy Nexus for Open-vocabulary Object Detection
Taming Self-Training for Open-Vocabulary Object Detection, CVPR 2024
LP-OVOD: Open-Vocabulary Object Detection by Linear Probing (WACV 2024)
Code release for "Weakly Supervised Open-Vocabulary Object Detection", AAAI2024
Mamba-YOLO-World: Marrying YOLO-World with Mamba for Open-Vocabulary Detection
state-of-the-art open vocabulary detector on COCO/LVIS/V3Det
InstaGen: Enhancing Object Detection by Training on Synthetic Dataset, CVPR2024
Official implementation of the paper "Boosting Human-Object Interaction Detection with Text-to-Image Diffusion Model"
This repository is an official implementation of the paper "LW-DETR: A Transformer Replacement to YOLO for Real-Time Detection".
Deformable DETR: Deformable Transformers for End-to-End Object Detection.
[ECCV 2024] Official implementation of "LaMI-DETR: Open-Vocabulary Detection with Language Model Instruction"
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
This is the official repo for "PromptAgent: Strategic Planning with Language Models Enables Expert-level Prompt Optimization". PromptAgent is a novel automatic prompt optimization method that auton…
A framework for prompt tuning using Intent-based Prompt Calibration
Code release for "Detecting Twenty-thousand Classes using Image-level Supervision".
Official Code for DiffMorpher: Unleashing the Capability of Diffusion Models for Image Morphing (CVPR 2024)