Lists (1)
Sort Name ascending (A-Z)
Stars
AIGC-interview/CV-interview/LLMs-interview面试问题与答案集合仓,同时包含工作和科研过程中的新想法、新问题、新资源与新项目
🚀🚀🚀A collection of some wesome public projects about Large Language Model(LLM), Vision Language Model(VLM), Vision Language Action(VLA), AI Generated Content(AIGC), the related Datasets and Applica…
Agent Laboratory is an end-to-end autonomous research workflow meant to assist you as the human researcher toward implementing your research ideas
[CVPR'23] Universal Instance Perception as Object Discovery and Retrieval
Offical implementation of "Strip R-CNN: Large Strip Convolution for Remote Sensing Object Detection"
Offical implementation of "Strip R-CNN: Large Strip Convolution for Remote Sensing Object Detection"
Offical implementation of "SM3Det: A Unified Model for Multi-Modal Remote Sensing Object Detection"
Official Code for 'TAR3D: Creating High-Quality 3D Assets via Next-Part Prediction'
Official repository of the paper "MaskCLIP++: A Mask-Based CLIP Fine-tuning Framework for Open-Vocabulary Image Segmentation"
Official Code for 'AR-1-to-3: Single Image to Consistent 3D Object Generation via Next-View Prediction'
A generative world for general-purpose robotics & embodied AI learning.
2021年最新总结,推荐工程师合适读本,计算机科学,软件技术,创业,思想类,数学类,人物传记书籍
[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…
Approaching (Almost) Any Machine Learning Problem中译版,在线文档地址:https://ytzfhqs.github.io/AAAMLP-CN/
[NeurIPS 2024] official code release for our paper "Revisiting the Integration of Convolution and Attention for Vision Backbone".
Official repository of Agent Attention (ECCV2024)
A list of referring video object segmentation papers
🔖 Curated list of video object segmentation (VOS) papers, datasets, and projects.
(NeurIPS 2024) Official repository of paper "Frozen-DETR: Enhancing DETR with Image Understanding from Frozen Foundation Models"
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Collection of AWESOME vision-language models for vision tasks
D-FINE: Redefine Regression Task of DETRs as Fine-grained Distribution Refinement 💥💥💥
[ICCV2023 Best Paper Finalist] PyTorch implementation of DiffusionDet (https://arxiv.org/abs/2211.09788)
[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale