Multi-Modal Classifiers for Open-Vocabulary Object Detection

Multi-Modal Classifiers for Open Vocabulary Object Detection,
Prannay Kaul, Weidi Xie, Andrew Zisserman
ICML 2023 (arXiv 2201.02605)

Updates

June 2023 Code and checkpoints for LVIS models in the main paper are released. Training code for visual aggregator to follow soon.

Installation

See installation instructions.

Benchmark evaluation and training

Please first prepare datasets, then check our MODEL ZOO to reproduce results in our paper.

License

See Detic. Our code is based on this repository.

Citation

If you find this project useful for your research, please use the following BibTeX entry.

@inproceedings{Kaul2023,
  title={Multi-Modal Classifiers for Open-Vocabulary Object Detection},
  author={Kaul, Prannay and Xie, Weidi and Zisserman, Andrew},
  booktitle={ICML},
  year={2023}
}

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
configs		configs
datasets		datasets
docs		docs
mmovod		mmovod
third_party		third_party
tools		tools
.gitmodules		.gitmodules
README.md		README.md
requirements.txt		requirements.txt
train_net_auto.py		train_net_auto.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multi-Modal Classifiers for Open-Vocabulary Object Detection

Updates

Installation

Benchmark evaluation and training

License

Citation

About

Releases

Packages

Languages

prannaykaul/mm-ovod

Folders and files

Latest commit

History

Repository files navigation

Multi-Modal Classifiers for Open-Vocabulary Object Detection

Updates

Installation

Benchmark evaluation and training

License

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages