-
Tsinghua University
- Shenzhen, Guangdong
Stars
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
TorchXRayVision: A library of chest X-ray datasets and models. Classifiers, segmentation, and autoencoders.
A collection of resources on applications of multi-modal learning in medical imaging.
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
[NeurIPS'22] Multi-Granularity Cross-modal Alignment for Generalized Medical Visual Representation Learning
Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) by way of Textual Inversion (https://arxiv.org/abs/2208.01618) for Stable Diffusion (https://arxiv.org/abs/2112.10752). Tweaks focuse…
Code for "Cascaded Latent Diffusion Models for High-Resolution Chest X-ray Synthesis" @ PAKDD 2023
Benchmark for Natural Temporal Distribution Shift (NeurIPS 2022)
Code for the paper "RECAP: Towards Precise Radiology Report Generation via Dynamic Disease Progression Reasoning" (EMNLP'23 Findings).
Implementation of ViViT: A Video Vision Transformer
Repository to train ControlNet on Brain data (UK BIOBANK) using MONAI Generative Models
[MICCAI-2023] ArSDM: Colonoscopy Images Synthesis with Adaptive Refinement Semantic Diffusion Models
[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation
Code for the paper "Addressing Model Vulnerability to Distributional Shifts over Image Transformation Sets", ICCV 2019
PyTorch Implementation: Code for the paper "Generalizing to Unseen Domains via Adversarial Data Augmentation", NeurIPS 2018. Origin Tensorflow Implementation: https://github.com/ricvolpi/generalize…
Code for the paper "Generalizing to Unseen Domains via Adversarial Data Augmentation", NeurIPS 2018
Character Animation (AnimateAnyone, Face Reenactment)
Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"
A robust classifier for few-training-data problem based on a distributionally robust optimization framework
Learning to Prompt (L2P) for Continual Learning @ CVPR22 and DualPrompt: Complementary Prompting for Rehearsal-free Continual Learning @ ECCV22
[NeurIPS'23 Spotlight] Official Repo for "Extraction and recovery of spatio-temporal structure in latent dynamics alignment with diffusion models"
Diffusion-TTA improves pre-trained discriminative models such as image classifiers or segmentors using pre-trained generative models.
A collection of AWESOME things about domian adaptation