🚀🚀🚀A collection of some wesome public projects about Large Language Model(LLM), Vision Language Model(VLM), Vision Language Action(VLA), AI Generated Content(AIGC), the related Datasets and Applica…

571 52 Updated Jan 8, 2025

SamuelSchmidgall / AgentLaboratory

Agent Laboratory is an end-to-end autonomous research workflow meant to assist you as the human researcher toward implementing your research ideas

Python 2,768 327 Updated Jan 13, 2025

zhengli97 / Awesome-Large-Vision-Language-Models

4 Updated Jan 14, 2025

MasterBin-IIAU / UNINEXT

[CVPR'23] Universal Instance Perception as Object Discovery and Retrieval

Python 1,518 158 Updated Jul 18, 2023

HVision-NKU / Strip-R-CNN

Offical implementation of "Strip R-CNN: Large Strip Convolution for Remote Sensing Object Detection"

12 Updated Jan 9, 2025

YXB-NKU / Strip-R-CNN

Offical implementation of "Strip R-CNN: Large Strip Convolution for Remote Sensing Object Detection"

39 Updated Jan 10, 2025

deepseek-ai / DeepSeek-V3

Python 18,847 1,514 Updated Jan 7, 2025

zcablii / SM3Det

Offical implementation of "SM3Det: A Unified Model for Multi-Modal Remote Sensing Object Detection"

Python 51 Updated Jan 13, 2025

HVision-NKU / TAR3D

Official Code for 'TAR3D: Creating High-Quality 3D Assets via Next-Part Prediction'

53 Updated Dec 26, 2024

HVision-NKU / MaskCLIPpp

Official repository of the paper "MaskCLIP++: A Mask-Based CLIP Fine-tuning Framework for Open-Vocabulary Image Segmentation"

Python 16 1 Updated Jan 11, 2025

HVision-NKU / AR123

Official Code for 'AR-1-to-3: Single Image to Consistent 3D Object Generation via Next-View Prediction'

9 Updated Dec 23, 2024

Genesis-Embodied-AI / Genesis

A generative world for general-purpose robotics & embodied AI learning.

Python 22,646 1,847 Updated Jan 12, 2025

0voice / expert_readed_books

2021年最新总结，推荐工程师合适读本，计算机科学，软件技术，创业，思想类，数学类，人物传记书籍

9,331 2,864 Updated Jun 11, 2024

HVision-NKU / DenseVLM

15 Updated Dec 11, 2024

FoundationVision / VAR

[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…

Jupyter Notebook 6,671 447 Updated Jan 12, 2025

yformer / EfficientTAM

Efficient Track Anything

Python 440 12 Updated Jan 6, 2025

ytzfhqs / AAAMLP-CN

Approaching (Almost) Any Machine Learning Problem中译版，在线文档地址：https://ytzfhqs.github.io/AAAMLP-CN/

Jupyter Notebook 1,593 209 Updated Mar 8, 2024

rayleizhu / GLMix

[NeurIPS 2024] official code release for our paper "Revisiting the Integration of Convolution and Attention for Vision Backbone".

Python 31 3 Updated Nov 22, 2024

LeapLabTHU / Agent-Attention

Official repository of Agent Attention (ECCV2024)

Python 577 39 Updated Nov 17, 2024

DavidZhangdw / Visual-Tracking-Development

Visual Object Tracking

Python 461 58 Updated Dec 2, 2024

Tavarich / Awesome-Referring-Video-Object-Segmentation

A list of referring video object segmentation papers

20 Updated Jan 8, 2025

gaomingqi / Awesome-Video-Object-Segmentation

🔖 Curated list of video object segmentation (VOS) papers, datasets, and projects.

238 7 Updated Jan 11, 2025

iSEE-Laboratory / Frozen-DETR

(NeurIPS 2024) Official repository of paper "Frozen-DETR: Enhancing DETR with Image Understanding from Frozen Foundation Models"

Python 23 2 Updated Nov 27, 2024

mlabonne / llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Jupyter Notebook 41,229 4,402 Updated Jul 28, 2024

jingyi0000 / VLM_survey

Collection of AWESOME vision-language models for vision tasks

2,700 229 Updated Dec 3, 2024

Peterande / D-FINE

D-FINE: Redefine Regression Task of DETRs as Fine-grained Distribution Refinement 💥💥💥

Python 1,353 109 Updated Jan 8, 2025

ShoufaChen / DiffusionDet

[ICCV2023 Best Paper Finalist] PyTorch implementation of DiffusionDet (https://arxiv.org/abs/2211.09788)

Python 2,131 161 Updated Dec 22, 2022

FoundationVision / GLEE

[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale

Python 1,131 86 Updated Oct 21, 2024