Name		Name	Last commit message	Last commit date
parent directory ..
image_classification		image_classification
multi_label_image_classification		multi_label_image_classification
object_detection		object_detection
pretraining		pretraining
semantic_segmentation		semantic_segmentation
README-multi-label-object-classification.md		README-multi-label-object-classification.md
README-object-detection.md		README-object-detection.md
README-pretraining.md		README-pretraining.md
README-semantic-segmentation.md		README-semantic-segmentation.md
README-single-label-object-classification.md		README-single-label-object-classification.md
README.md		README.md

README.md

CatLIP: CLIP-level Visual Recognition Accuracy with 2.7x Faster Pre-training on Web-scale Image-Text Data

CatLIP introduces a novel weakly supervised pre-training approach for vision models on web-scale noisy image-text data, reframing pre-training as a classification task to circumvent computational challenges associated with pairwise similarity computations in contrastive learning, resulting in a significant 2.7x acceleration in training speed while maintaining high representation quality across various vision tasks.

We provide training and evaluation code along with pretrained models and configuration files for the following tasks:

Citation

If you find our work useful, please cite:

@article{mehta2024catlip,
  title={CatLIP: CLIP-level Visual Recognition Accuracy with 2.7x Faster Pre-training on Web-scale Image-Text Data}, 
  author={Sachin Mehta and Maxwell Horton and Fartash Faghri and Mohammad Hossein Sekhavat and Mahyar Najibi and Mehrdad Farajtabar and Oncel Tuzel and Mohammad Rastegari},
  year={2024},
  eprint={2404.15653},
  archivePrefix={arXiv},
  primaryClass={cs.CV}
}

@inproceedings{mehta2022cvnets, 
     author = {Mehta, Sachin and Abdolhosseini, Farzad and Rastegari, Mohammad}, 
     title = {CVNets: High Performance Library for Computer Vision}, 
     year = {2022}, 
     booktitle = {Proceedings of the 30th ACM International Conference on Multimedia}, 
     series = {MM '22} 
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

catlip

catlip

README.md

CatLIP: CLIP-level Visual Recognition Accuracy with 2.7x Faster Pre-training on Web-scale Image-Text Data

Citation

Files

catlip

Directory actions

More options

Directory actions

More options

Latest commit

History

catlip

Folders and files

parent directory

README.md

CatLIP: CLIP-level Visual Recognition Accuracy with 2.7x Faster Pre-training on Web-scale Image-Text Data

Citation