-
HKUST(GZ)
- CHine
-
06:47
(UTC -12:00) - https://scholar.google.com/citations?user=EVpo9eQAAAAJ&hl=zh-CN&oi=ao
Highlights
- Pro
Lists (1)
Sort Name ascending (A-Z)
Stars
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
Code for CVPR'24 Paper: Segment Any Event Streams via Weighted Adaptation of Pivotal Tokens
Open-Sora: Democratizing Efficient Video Production for All
This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicitation and Distillation Algorithms, and explore the Skill & V…
【ICLR 2024🔥】 Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment
Learning Multiple Dense Prediction Tasks from Partially Annotated Data - CVPR 2022
ImageBind One Embedding Space to Bind Them All
[NeurIPS 2024] SlimSAM: 0.1% Data Makes Segment Anything Slim
This repository is for the first comprehensive survey on Meta AI's Segment Anything Model (SAM).
Tracking and collecting papers/projects/others related to Segment Anything.
Collection of AWESOME vision-language models for vision tasks
A PyTorch Library for Multi-Task Learning
CrossKD: Cross-Head Knowledge Distillation for Dense Object Detection
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (p…
Taming Transformers for High-Resolution Image Synthesis
Papers, codes, datasets, researchers on information bottleneck
Pytorch implementation of VQGAN (Taming Transformers for High-Resolution Image Synthesis) (https://arxiv.org/pdf/2012.09841.pdf)
Paper List for Multi-Task Learning (focus on architectures and optimization for MTL)
[ICCV 2021] Code for our paper Domain Adaptive Semantic Segmentation with Self-Supervised Depth Estimation
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".