Skip to content

Official repo for our ICML 23 paper: "Multi-Modal Classifiers for Open-Vocabulary Object Detection"

Notifications You must be signed in to change notification settings

prannaykaul/mm-ovod

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Multi-Modal Classifiers for Open-Vocabulary Object Detection

Multi-Modal Classifiers for Open Vocabulary Object Detection,
Prannay Kaul, Weidi Xie, Andrew Zisserman
ICML 2023 (arXiv 2201.02605)

Updates

  • June 2023 Code and checkpoints for LVIS models in the main paper are released. Training code for visual aggregator to follow soon.

Installation

See installation instructions.

Benchmark evaluation and training

Please first prepare datasets, then check our MODEL ZOO to reproduce results in our paper.

License

See Detic. Our code is based on this repository.

Citation

If you find this project useful for your research, please use the following BibTeX entry.

@inproceedings{Kaul2023,
  title={Multi-Modal Classifiers for Open-Vocabulary Object Detection},
  author={Kaul, Prannay and Xie, Weidi and Zisserman, Andrew},
  booktitle={ICML},
  year={2023}
}

About

Official repo for our ICML 23 paper: "Multi-Modal Classifiers for Open-Vocabulary Object Detection"

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Languages