Starred repositories
collection of diffusion model papers categorized by their subareas
A minimal implementation of a denoising diffusion model in PyTorch.
SimVQ: Addressing Representation Collapse in Vector Quantized Models with One Linear Layer
[ICLR 2024 Oral] Supervised Pre-Trained 3D Models for Medical Image Analysis (9,262 CT volumes + 25 annotated classes)
[CVPR 2025] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models
High-performance Image Tokenizers for VAR and AR
MICCAI 2023: DHC: Dual-debiased Heterogeneous Co-training Framework for Class-imbalanced Semi-supervised Medical Image Segmentation
Decoupled Consistency for Semi-supervised Medical Image Segmentation(MICCAI 2023)
[CVPR 2023] Revisiting Weak-to-Strong Consistency in Semi-Supervised Semantic Segmentation
BCI: Breast Cancer Immunohistochemical Image Generation through Pyramid Pix2pix
[ECCV 2024] Official PyTorch implementation of RoPE-ViT "Rotary Position Embedding for Vision Transformer"
Large World Model -- Modeling Text and Video with Millions Context
PWM: Policy Learning with Large World Models
[ICLR 2025] Semi-Supervised Vision-Centric 3D Occupancy World Model for Autonomous Driving
Collect some World Models for Autonomous Driving (and Robotic) papers.
Official repository for "DiffAssemble: A Unified Graph-Diffusion Model for 2D and 3D Reassembly" accepted at CVPR2024
《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程
NeurIPS'24 DB (Spotlight) | Instruction Tuning Large Language Models to Understand Electronic Health Records
PyTorch implementation for "WDM: 3D Wavelet Diffusion Models for High-Resolution Medical Image Synthesis" (DGM4MICCAI 2024)
"Beyond the Snapshot: Brain Tokenized Graph Transformer for Longitudinal Brain Functional Connectome Embedding" (MICCAI 2023)
A suite of image and video neural tokenizers
[IEEE TMI 2024] Pseudo-Bag Mixup Augmentation for Multiple Instance Learning-Based Whole Slide Image Classification