An intelligent assistant serving the entire software development lifecycle, powered by a Multi-Agent Framework, working with DevOps Toolkits, Code&Doc Repo RAG, etc.

Python 1,129 116 Updated Jul 1, 2024

FoundationVision / GLEE

[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale

Python 1,096 86 Updated Oct 21, 2024

magic-research / Sa2VA

🔥 Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos

Python 932 59 Updated Feb 25, 2025

mbzuai-oryx / groundingLMM

[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.

Python 834 42 Updated Nov 23, 2024

WangLibo1995 / GeoSeg

UNetFormer: A UNet-like transformer for efficient semantic segmentation of remote sensing urban scene imagery, ISPRS. Also, including other vision transformers and CNNs for satellite, aerial image …

Python 833 124 Updated Aug 19, 2024

mbzuai-oryx / LLaVA-pp

🔥🔥 LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)

Python 832 61 Updated Jul 10, 2024

if-ai / ComfyUI-IF_AI_tools

ComfyUI-IF_AI_tools is a set of custom nodes for ComfyUI that allows you to generate prompts using a local Large Language Model (LLM) via Ollama. This tool enables you to enhance your image generat…

Python 607 47 Updated Jan 3, 2025

shenyunhang / APE

[CVPR 2024] Aligning and Prompting Everything All at Once for Universal Visual Perception

Python 550 41 Updated May 8, 2024

xinghaochen / TinySAM

[AAAI 2025] Official PyTorch implementation of "TinySAM: Pushing the Envelope for Efficient Segment Anything Model"

Python 445 27 Updated Jan 19, 2025

opengeos / WhiteboxTools-ArcGIS

ArcGIS Python Toolbox for WhiteboxTools

Python 275 66 Updated Nov 11, 2024

walking-shadow / Official_Remote_Sensing_Mamba

Official code of Remote Sensing Mamba

Python 269 14 Updated Apr 25, 2024

Beckschen / ViTamin

[CVPR 2024] Official implementation of "ViTamin: Designing Scalable Vision Models in the Vision-language Era"

Python 197 7 Updated Jun 9, 2024

likyoo / SegEarth-OV

[CVPR 2025] SegEarth-OV: Towards Training-Free Open-Vocabulary Segmentation for Remote Sensing Images

Python 74 1 Updated Dec 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

skylning

Block or report skylning

Stars

modelscope / agentscope

princeton-vl / infinigen

hephaest0s / usbkill

om-ai-lab / VLM-R1

dvlab-research / MGM

lyuwenyu / RT-DETR

IDEA-Research / T-Rex

qianqianwang68 / omnimotion

cambrian-mllm / cambrian

ZrrSkywalker / Personalize-SAM

siyuanliii / masa

codefuse-ai / codefuse-chatbot