Stars
[ECCVW 2022] The codes for the work "Swin-Unet: Unet-like Pure Transformer for Medical Image Segmentation"
The implementation of our ACM MM 2023 paper "AdvCLIP: Downstream-agnostic Adversarial Examples in Multimodal Contrastive Learning"
deep learning for image processing including classification and object-detection etc.
OpenMMLab Pre-training Toolbox and Benchmark
多模态情感分析——基于BERT+ResNet的多种融合方法
A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
Multimodal deep learning model for long-term cancer survival prediction
[ACL'19] [PyTorch] Multimodal Transformer
Official Pytorch Code for "Medical Transformer: Gated Axial-Attention for Medical Image Segmentation" - MICCAI 2021
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
[NeurIPS 2021] You Only Look at One Sequence
End-to-End Object Detection with Transformers
[IEEE TIP 2022] Official implementation of MATR: Multimodal Medical Image Fusion via Multiscale Adaptive Transformer
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
CSAW-M: An Ordinal Classification Dataset for Benchmarking Mammographic Masking of Cancer