Stars
A list of awesome papers and resources of recommender system on large language model (LLM).
An unofficial implementation of both ViT-VQGAN and RQ-VAE in Pytorch
Pytorch implementation of Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors
An open source implementation of "Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning", an all-new multi modal AI that uses just a decoder to generate both text and images
Implementation of 🦩 Flamingo, state-of-the-art few-shot visual question answering attention net out of Deepmind, in Pytorch
we want to create a repo to illustrate usage of transformers in chinese
Train and Infer Powerful Sentence Embeddings with AnglE | 🔥 SOTA on STS and MTEB Leaderboard
When do we not need larger vision models?
[CVPR2024 Highlight] Adapting Visual-Language Models for Generalizable Anomaly Detection in Medical Images
Experiment on combining CLIP with SAM to do open-vocabulary image segmentation.
[ECCV 2024] The official code of paper "Open-Vocabulary SAM".
[CVPR 2024] Alpha-CLIP: A CLIP Model Focusing on Wherever You Want
Pathology Language and Image Pre-Training (PLIP) is the first vision and language foundation model for Pathology AI (Nature Medicine). PLIP is a large-scale pre-trained model that can be used to ex…
A collection of resources on applications of multi-modal learning in medical imaging.
Segment Anything in Medical Images
Project Page for "LISA: Reasoning Segmentation via Large Language Model"
[CVPR2024] ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts
An official implementation of "Incomplete Multimodality-Diffused Emotion Recognition" in PyTorch. (NeurIPS 2023)
Uniformaly: Towards Task-Agnostic Unified Anomaly Detection
Official Implement of "ADGym: Design Choices for Deep Anomaly Detection", NeurIPS 2023
Anomaly detection with diffusion models
My attempt at reproducing the paper Deep Autoencoding Gaussian Mixture Model for Unsupervised Anomaly Detection
HVTSurv: Hierarchical Vision Transformer for Patient-level Survival Prediction from Whole Slide Image-AAAI 2023
A Python Library for Outlier and Anomaly Detection, Integrating Classical and Deep Learning Techniques