Skip to content
View mgh233's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report mgh233

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 7 Updated Jan 8, 2025

AIGC-interview/CV-interview/LLMs-interview面试问题与答案集合仓,同时包含工作和科研过程中的新想法、新问题、新资源与新项目

1,915 186 Updated Jan 13, 2025

🚀🚀🚀A collection of some wesome public projects about Large Language Model(LLM), Vision Language Model(VLM), Vision Language Action(VLA), AI Generated Content(AIGC), the related Datasets and Applica…

571 52 Updated Jan 8, 2025

Agent Laboratory is an end-to-end autonomous research workflow meant to assist you as the human researcher toward implementing your research ideas

Python 2,768 327 Updated Jan 13, 2025

[CVPR'23] Universal Instance Perception as Object Discovery and Retrieval

Python 1,518 158 Updated Jul 18, 2023

Offical implementation of "Strip R-CNN: Large Strip Convolution for Remote Sensing Object Detection"

12 Updated Jan 9, 2025

Offical implementation of "Strip R-CNN: Large Strip Convolution for Remote Sensing Object Detection"

39 Updated Jan 10, 2025

Offical implementation of "SM3Det: A Unified Model for Multi-Modal Remote Sensing Object Detection"

Python 51 Updated Jan 13, 2025

Official Code for 'TAR3D: Creating High-Quality 3D Assets via Next-Part Prediction'

53 Updated Dec 26, 2024

Official repository of the paper "MaskCLIP++: A Mask-Based CLIP Fine-tuning Framework for Open-Vocabulary Image Segmentation"

Python 16 1 Updated Jan 11, 2025

Official Code for 'AR-1-to-3: Single Image to Consistent 3D Object Generation via Next-View Prediction'

9 Updated Dec 23, 2024

A generative world for general-purpose robotics & embodied AI learning.

Python 22,646 1,847 Updated Jan 12, 2025

2021年最新总结,推荐工程师合适读本,计算机科学,软件技术,创业,思想类,数学类,人物传记书籍

9,331 2,864 Updated Jun 11, 2024
15 Updated Dec 11, 2024

[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…

Jupyter Notebook 6,671 447 Updated Jan 12, 2025

Efficient Track Anything

Python 440 12 Updated Jan 6, 2025

Approaching (Almost) Any Machine Learning Problem中译版,在线文档地址:https://ytzfhqs.github.io/AAAMLP-CN/

Jupyter Notebook 1,593 209 Updated Mar 8, 2024

[NeurIPS 2024] official code release for our paper "Revisiting the Integration of Convolution and Attention for Vision Backbone".

Python 31 3 Updated Nov 22, 2024

Official repository of Agent Attention (ECCV2024)

Python 577 39 Updated Nov 17, 2024

Visual Object Tracking

Python 461 58 Updated Dec 2, 2024

A list of referring video object segmentation papers

20 Updated Jan 8, 2025

🔖 Curated list of video object segmentation (VOS) papers, datasets, and projects.

238 7 Updated Jan 11, 2025

(NeurIPS 2024) Official repository of paper "Frozen-DETR: Enhancing DETR with Image Understanding from Frozen Foundation Models"

Python 23 2 Updated Nov 27, 2024

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Jupyter Notebook 41,229 4,402 Updated Jul 28, 2024

Collection of AWESOME vision-language models for vision tasks

2,700 229 Updated Dec 3, 2024

D-FINE: Redefine Regression Task of DETRs as Fine-grained Distribution Refinement 💥💥💥

Python 1,353 109 Updated Jan 8, 2025

[ICCV2023 Best Paper Finalist] PyTorch implementation of DiffusionDet (https://arxiv.org/abs/2211.09788)

Python 2,131 161 Updated Dec 22, 2022

[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale

Python 1,131 86 Updated Oct 21, 2024
Next