Skip to content
View taovv's full-sized avatar
👻
👻
  • SUSTech, PCL
  • ShenZhen
  • 07:09 (UTC +08:00)

Highlights

  • Pro

Block or report taovv

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official repository of ’Visual-RFT: Visual Reinforcement Fine-Tuning’

Python 1,601 74 Updated Apr 18, 2025

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

Python 2,167 151 Updated Feb 16, 2025

[Neurips'24 Spotlight] Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought Reasoning

Python 301 15 Updated Dec 22, 2024

[CVPR 2025] Official repository of the paper "Mask-Adapter: The Devil is in the Masks for Open-Vocabulary Segmentation"

Python 86 1 Updated Feb 27, 2025

📚 A collection of papers about Referring Image Segmentation.

711 56 Updated Apr 14, 2025

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 15,111 1,670 Updated Dec 25, 2024

Adapters Strike Back (CVPR 2024)

Python 35 1 Updated Jul 24, 2024

[CVPR2024] Generative Region-Language Pretraining for Open-Ended Object Detection

Python 167 7 Updated Mar 29, 2025

CLIP Surgery for Better Explainability with Enhancement in Open-Vocabulary Tasks

Jupyter Notebook 415 27 Updated Mar 1, 2025

[CVPR2024] Open-Vocabulary Semantic Segmentation with Image Embedding Balancing

Python 31 Updated Jun 17, 2024

[CVPR'24 Highlight] SHiNe: Semantic Hierarchy Nexus for Open-vocabulary Object Detection

Python 92 8 Updated Jul 24, 2024

Mamba SSM architecture

Python 14,672 1,279 Updated Apr 1, 2025

[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Python 3,377 231 Updated Feb 13, 2025

VMamba: Visual State Space Models,code is based on mamba

Python 2,555 175 Updated Mar 7, 2025

Project for "LaSagnA: Language-based Segmentation Assistant for Complex Queries".

Python 54 2 Updated Apr 29, 2024

[ECCV 2024] The official code of paper "Open-Vocabulary SAM".

Python 957 32 Updated Jul 31, 2024

[WACV 2025] Python implementation of Sigma: Siamese Mamba Network for Multi-Modal Semantic Segmentation

Python 226 20 Updated Jan 24, 2025

Official Implementation of "CAT-Seg🐱: Cost Aggregation for Open-Vocabulary Semantic Segmentation"

Python 298 29 Updated Apr 11, 2024

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Python 5,333 511 Updated Feb 26, 2025

Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.

Python 31,810 7,643 Updated Apr 13, 2025

(TPAMI 2024) A Survey on Open Vocabulary Learning

924 50 Updated Mar 23, 2025

Open-vocabulary Semantic Segmentation

Python 342 33 Updated Oct 16, 2024

[NeurIPS 2023] This repo contains the code for our paper Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convolutional CLIP

Python 313 28 Updated Feb 5, 2024

A curated publication list on open vocabulary semantic segmentation and related area (e.g. zero-shot semantic segmentation) resources..

605 29 Updated Apr 8, 2025

Grounded Language-Image Pre-training

Python 2,385 206 Updated Jan 24, 2024

[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"

Python 4,565 424 Updated Aug 19, 2024

Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.

Python 6,690 491 Updated May 31, 2024

为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…

Python 68,263 8,334 Updated Apr 20, 2025

This repository is for the first comprehensive survey on Meta AI's Segment Anything Model (SAM).

940 62 Updated Apr 23, 2025

EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything

Jupyter Notebook 2,321 152 Updated Dec 24, 2024
Next