-
SUSTech, PCL
- ShenZhen
-
07:09
(UTC +08:00)
Highlights
- Pro
Stars
Official repository of ’Visual-RFT: Visual Reinforcement Fine-Tuning’
Project Page for "LISA: Reasoning Segmentation via Large Language Model"
[Neurips'24 Spotlight] Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought Reasoning
[CVPR 2025] Official repository of the paper "Mask-Adapter: The Devil is in the Masks for Open-Vocabulary Segmentation"
📚 A collection of papers about Referring Image Segmentation.
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
[CVPR2024] Generative Region-Language Pretraining for Open-Ended Object Detection
CLIP Surgery for Better Explainability with Enhancement in Open-Vocabulary Tasks
[CVPR2024] Open-Vocabulary Semantic Segmentation with Image Embedding Balancing
[CVPR'24 Highlight] SHiNe: Semantic Hierarchy Nexus for Open-vocabulary Object Detection
[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model
VMamba: Visual State Space Models,code is based on mamba
Project for "LaSagnA: Language-based Segmentation Assistant for Complex Queries".
[ECCV 2024] The official code of paper "Open-Vocabulary SAM".
[WACV 2025] Python implementation of Sigma: Siamese Mamba Network for Multi-Modal Semantic Segmentation
Official Implementation of "CAT-Seg🐱: Cost Aggregation for Open-Vocabulary Semantic Segmentation"
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
(TPAMI 2024) A Survey on Open Vocabulary Learning
[NeurIPS 2023] This repo contains the code for our paper Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convolutional CLIP
A curated publication list on open vocabulary semantic segmentation and related area (e.g. zero-shot semantic segmentation) resources..
[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"
Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…
This repository is for the first comprehensive survey on Meta AI's Segment Anything Model (SAM).
EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything