-
Hangzhou Dianzi University
- HangZhou, Zhejiang
-
02:36
(UTC -12:00) - https://www.hdu.edu.cn/main.htm
Highlights
- Pro
Stars
[MICCAI 2024] TeethDreamer: 3D Teeth Reconstruction from Five Intra-oral Photographs
Official code for the MICCAI 2024 paper "3D Vessel Graph Generation Using Denoising Diffusion"
The source code of IEEE TPAMI 2025 "Hyper-YOLO: When Visual Object Detection Meets Hypergraph Computation".
VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs
A Platform for Visual Learning from Human Feedback
A collection of AWESOME things about mixture-of-experts
[CIBM 2024] Segment Anything Model for Medical Image Segmentation: Open-Source Project Summary
Efficient Segment Anything in Medical Images
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
This repository is for the first comprehensive survey on Meta AI's Segment Anything Model (SAM).
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
A pytorch-based deep learning framework for multi-modal 2D/3D medical image segmentation
PyTorch Lightning + Hydra. A very user-friendly template for ML experimentation. ⚡🔥⚡
VMamba: Visual State Space Models,code is based on mamba
[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model
🚀🚀🚀 A collection of some awesome public YOLO object detection series projects and the related object detection datasets.
[AAAI' 25] U-KAN Makes Strong Backbone for Medical Image Segmentation and Generation
Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.
[CVPR'24] Multimodal Pathway: Improve Transformers with Irrelevant Data from Other Modalities
Segment Anything in Medical Images
Comparing performance of a small transformer model with and without Knowledge Distillation
[ICCV2023 Best Paper Finalist] PyTorch implementation of DiffusionDet (https://arxiv.org/abs/2211.09788)
1.5−3.0× lossless training or pre-training speedup. An off-the-shelf, easy-to-implement algorithm for the efficient training of foundation visual backbones.
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
リアルタイムボイスチェンジャー Realtime Voice Changer
Blendshape and kinematics calculator for Mediapipe/Tensorflow.js Face, Eyes, Pose, and Finger tracking models.