-
Macau University
- Taipa University Road, Macau, China
-
00:39
(UTC -12:00)
Highlights
- Pro
Stars
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
Generative Models by Stability AI
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
[ICCV 2023 Oral] Text-to-Image Diffusion Models are Zero-Shot Video Generators
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
A collection of loss functions for medical image segmentation
Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models
Official PyTorch implementation of SegFormer
VMamba: Visual State Space Models,code is based on mamba
Semi Supervised Learning for Medical Image Segmentation, a collection of literature reviews and code implementations.
Papers about pretraining and self-supervised learning on Graph Neural Networks (GNN).
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
A Unified Semi-Supervised Learning Codebase (NeurIPS'22)
PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838
Implementation of Video Diffusion Models, Jonathan Ho's new paper extending DDPMs to Video Generation - in Pytorch
[ICLR 2025] Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.
This repository contains the code of the CVPR 2022 paper "Image Segmentation Using Text and Image Prompts".
❄️🔥 Visual Prompt Tuning [ECCV 2022] https://arxiv.org/abs/2203.12119
Corruption and Perturbation Robustness (ICLR 2019)
A contrastive learning based semi-supervised segmentation network for medical image segmentation
Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI
[ECCV 2024 Oral] MotionDirector: Motion Customization of Text-to-Video Diffusion Models.
code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"
Implemention some Baseline Model upon Bert for Text Classification
Code for "Detector-Free Structure from Motion", CVPR 2024
[CVPR 2024 Oral] Official repository of FMA-Net