Bh-Johnny

0S Wang Bh-Johnny

Shanghaitech University/ major in CS

1 follower · 7 following

Shanghaitech University
shanghai, China
12:26 (UTC +08:00)

Stars

microsoft / RegionCLIP

[CVPR 2022] Official code for "RegionCLIP: Region-based Language-Image Pretraining"

Python 731 52 Updated Mar 20, 2024

apple / ml-ferret

Python 8,532 506 Updated Oct 9, 2024

mbzuai-oryx / groundingLMM

[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.

Python 809 39 Updated Nov 23, 2024

AI-Application-and-Integration-Lab / SAM4MLLM

SAM4MLLM: Enhance Multi-Modal Large Language Model for Referring Expression Segmentation, 2024

Jupyter Notebook 9 Updated Jan 2, 2025

jingyi0000 / VLM_survey

Collection of AWESOME vision-language models for vision tasks

2,678 227 Updated Dec 3, 2024

dvlab-research / LISA

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

Python 1,961 130 Updated Dec 30, 2024

xinyu1205 / recognize-anything

Open-source and strong foundation image recognition models.

Jupyter Notebook 3,002 281 Updated Aug 1, 2024

showlab / Awesome-GUI-Agent

💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.

390 24 Updated Jan 5, 2025

showlab / ShowUI

Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.

Jupyter Notebook 785 41 Updated Jan 4, 2025

showlab / computer_use_ootb

Out-of-the-box (OOTB) GUI Agent for Windows and macOS

Python 1,105 98 Updated Jan 1, 2025

Paitesanshi / LLM-Agent-Survey

2,674 152 Updated Dec 15, 2024

microsoft / autogen

A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour

Jupyter Notebook 36,825 5,328 Updated Jan 5, 2025

LeslieZhoa / HeSer.Pytorch

unofficial implementation of Few-Shot Head Swapping in the Wild

Python 41 6 Updated Nov 26, 2023

M-3LAB / awesome-industrial-anomaly-detection

Paper list and datasets for industrial image anomaly/defect detection (updating). 工业异常/瑕疵检测论文及数据集检索库(持续更新)。

1,760 161 Updated Jan 3, 2025

microsoft / OmniParser

A simple screen parsing tool towards pure vision based GUI agent

Jupyter Notebook 5,421 425 Updated Jan 5, 2025

Significant-Gravitas / AutoGPT

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Python 170,114 44,750 Updated Jan 5, 2025

Charmve / computer-vision-in-action

A computer vision closed-loop learning platform where code can be run interactively online. 学习闭环《计算机视觉实战演练：算法与应用》中文电子书、源码、读者交流社区（持续更新中 ...） 📘 在线电子书 https://charmve.github.io/computer-vision-in-acti…

Jupyter Notebook 2,623 391 Updated May 27, 2024

HapKoM / coco-faces

COCO 2017 dataset labeled for face detection

Jupyter Notebook 19 5 Updated Jun 13, 2019

ByungKwanLee / Full-Segment-Anything

This is Pytorch Implementation Code for adding new features in code of Segment-Anything. Here, the features support batch-input on the full-grid prompt (automatic mask generation) with post-process…

Python 147 9 Updated Dec 7, 2023

ekremerakin / RealWorldOccludedFaces

Real World Occluded Face dataset containing 3195 neutral images, 1686 sunglasses images and 678 masked images.

Python 34 3 Updated Nov 29, 2021

jiwei0921 / SOD-CNNs-based-code-summary-

The summary of code and paper for salient object detection with deep learning.

852 148 Updated Nov 14, 2024

ChunmingHe / awesome-concealed-object-segmentation

237 6 Updated Jan 5, 2025

DCGM / ffhq-features-dataset

Forked from NVlabs/ffhq-dataset

Gender, Age, and Emotion for Flickr-Faces-HQ Dataset (FFHQ)

Shell 99 8 Updated Jan 2, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly