Starred repositories
This repository includes the official implementation of our paper "Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generation"
“FlowAR: Scale-wise Autoregressive Image Generation Meets Flow Matching” FlowAR employs a simplest scale design and is compatible with any VAE.
Implementation of UltraPixel: Advancing Ultra-High-Resolution Image Synthesis to New Peaks
This repository is the official implementation of our Autoregressive Pretraining with Mamba in Vision
Implementation of Diffusion Transformer (DiT) in JAX
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…
[ICLR 2024 Oral] Supervised Pre-Trained 3D Models for Medical Image Analysis (9,262 CT volumes + 25 annotated classes)
Code base for behavior cloning algorithms, built with simplicity in mind.
[NeurIPS 2023] AbdomenAtlas 1.0 (5,195 CT volumes + 9 annotated classes)
[ICML 2024] This repository includes the official implementation of our paper "Rejuvenating image-GPT as Strong Visual Representation Learners"
[CVPR 2024] The official implementation of paper "Sculpting Holistic 3D Representation in Contrastive Language-Image-3D Pre-training"
This repository is a collection of research papers on World Models.
VideoLLM: Modeling Video Sequence with Large Language Models
An interactive demo based on Segment-Anything for style transfer which enables different content regions apply different styles.
A pytorch implementation for paper 'Exploring Simple Siamese Representation Learning'