Stars
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Official Codes for "Uniform Masking: Enabling MAE Pre-training for Pyramid-based Vision Transformers with Locality"
[CVPR'W19-Oral] Official repository for "iSAID: A Large-scale Dataset for Instance Segmentation in Aerial Images"
Using DUCK-Net for polyp image segmentation. ( Nature Scientific Reports 2023 )
Dataset API for "PhraseCut: Language-based Image Segmentation in the Wild"
For holding anime-related object classification and detection models
Semantic segmentation of remote sensing image, using DeepLabv3(PyTorch)
This is an official repository of ``VLAttack: Multimodal Adversarial Attacks on Vision-Language Tasks via Pre-trained Models'' (NeurIPS 2023).
Gland segmentation task with GlaS 2015 dataset using UNet model.
Official implementation of "FreeSeed: Frequency-band-aware and Self-guided Network for Sparse-view CT Reconstruction"