Skip to content
View Bh-Johnny's full-sized avatar
  • Shanghaitech University
  • shanghai, China
  • 12:26 (UTC +08:00)

Block or report Bh-Johnny

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[CVPR 2022] Official code for "RegionCLIP: Region-based Language-Image Pretraining"

Python 731 52 Updated Mar 20, 2024
Python 8,532 506 Updated Oct 9, 2024

[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.

Python 809 39 Updated Nov 23, 2024

SAM4MLLM: Enhance Multi-Modal Large Language Model for Referring Expression Segmentation, 2024

Jupyter Notebook 9 Updated Jan 2, 2025

Collection of AWESOME vision-language models for vision tasks

2,678 227 Updated Dec 3, 2024

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

Python 1,961 130 Updated Dec 30, 2024

Open-source and strong foundation image recognition models.

Jupyter Notebook 3,002 281 Updated Aug 1, 2024

💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.

390 24 Updated Jan 5, 2025

Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.

Jupyter Notebook 785 41 Updated Jan 4, 2025

Out-of-the-box (OOTB) GUI Agent for Windows and macOS

Python 1,105 98 Updated Jan 1, 2025

A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour

Jupyter Notebook 36,825 5,328 Updated Jan 5, 2025

unofficial implementation of Few-Shot Head Swapping in the Wild

Python 41 6 Updated Nov 26, 2023

Paper list and datasets for industrial image anomaly/defect detection (updating). 工业异常/瑕疵检测论文及数据集检索库(持续更新)。

1,760 161 Updated Jan 3, 2025

A simple screen parsing tool towards pure vision based GUI agent

Jupyter Notebook 5,421 425 Updated Jan 5, 2025

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Python 170,114 44,750 Updated Jan 5, 2025

A computer vision closed-loop learning platform where code can be run interactively online. 学习闭环《计算机视觉实战演练:算法与应用》中文电子书、源码、读者交流社区(持续更新中 ...) 📘 在线电子书 https://charmve.github.io/computer-vision-in-acti…

Jupyter Notebook 2,623 391 Updated May 27, 2024

COCO 2017 dataset labeled for face detection

Jupyter Notebook 19 5 Updated Jun 13, 2019

This is Pytorch Implementation Code for adding new features in code of Segment-Anything. Here, the features support batch-input on the full-grid prompt (automatic mask generation) with post-process…

Python 147 9 Updated Dec 7, 2023

Real World Occluded Face dataset containing 3195 neutral images, 1686 sunglasses images and 678 masked images.

Python 34 3 Updated Nov 29, 2021

The summary of code and paper for salient object detection with deep learning.

852 148 Updated Nov 14, 2024

Gender, Age, and Emotion for Flickr-Faces-HQ Dataset (FFHQ)

Shell 99 8 Updated Jan 2, 2020

🏂🏻 程序员海外工作/英文面试手册

4,537 326 Updated Feb 25, 2024

High-resolution models for human tasks.

Python 4,713 269 Updated Nov 18, 2024

[ECCV 2024] The official code of paper "Open-Vocabulary SAM".

Python 975 31 Updated Jul 31, 2024

This repository contains demos I made with the Transformers library by HuggingFace.

Jupyter Notebook 9,741 1,480 Updated Oct 21, 2024

Effortless data labeling with AI support from Segment Anything and other awesome models.

Python 4,589 516 Updated Jan 3, 2025

Customized ID Consistent for human

Python 915 81 Updated Jan 4, 2025
Python 106 10 Updated Oct 23, 2022
Next