Skip to content
View chibohe's full-sized avatar

Block or report chibohe

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Open-source and strong foundation image recognition models.

Jupyter Notebook 3,060 286 Updated Aug 1, 2024

Witness the aha moment of VLM with less than $3.

Python 1,967 146 Updated Feb 10, 2025

Material for gpu-mode lectures

Jupyter Notebook 3,667 370 Updated Feb 9, 2025

Awesome Data-Driven Autonomous Driving Solutions. Also the official repository of our survey paper: Data-Centric Evolution in Autonomous Driving: A Comprehensive Survey of Big Data System, Data Min…

160 6 Updated Mar 20, 2024

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Python 3,341 1,415 Updated Feb 9, 2025

A FastAPI service for semantic text search using precomputed embeddings and advanced similarity measures, with built-in support for various file types through textract.

Python 998 59 Updated Oct 4, 2024

Research Software Engineering with Python course material

TeX 250 63 Updated Dec 16, 2024

Software Design by Example: a tool-based introduction with Python

Python 408 61 Updated Dec 1, 2024

AIGC-interview/CV-interview/LLMs-interview面试问题与答案集合仓,同时包含工作和科研过程中的新想法、新问题、新资源与新项目

1,987 188 Updated Jan 13, 2025

Official repository of "Inst-IT: Boosting Multimodal Instance Understanding via Explicit Visual Prompt Instruction Tuning"

26 Updated Feb 10, 2025

Document Artifical Intelligence

144 6 Updated Dec 8, 2024

Repository of finetuning Open-Source VLM

Python 2 Updated Dec 27, 2024

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Python 11,502 1,148 Updated Feb 3, 2025

This is a Phi Family of SLMs book for getting started with Phi Models. Phi a family of open sourced AI models developed by Microsoft. Phi models are the most capable and cost-effective small langua…

Jupyter Notebook 2,717 325 Updated Jan 27, 2025

An official repo for WACV 2025 paper "LLaVA-SpaceSGG: Visual Instruct Tuning for Open-vocabulary Scene Graph Generation with Enhanced Spatial Relations"

Python 8 Updated Jan 27, 2025
Python 26 Updated Dec 12, 2024
Python 47 3 Updated Dec 18, 2024

This repository offers a comprehensive collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures like ResNet to cutting-e…

Jupyter Notebook 6,861 1,086 Updated Feb 4, 2025

(TPAMI 2024) A Survey on Open Vocabulary Learning

881 50 Updated Dec 10, 2024

EMOv2: Pushing 5M Vision Model Frontier

Python 42 1 Updated Dec 30, 2024

[NeurIPS'24] This repository is the implementation of "SpatialRGPT: Grounded Spatial Reasoning in Vision Language Models"

Python 111 9 Updated Dec 14, 2024

Compose multimodal datasets 🎹

Python 276 12 Updated Feb 3, 2025

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Python 2,832 226 Updated Feb 10, 2025
Python 38 5 Updated Nov 7, 2024

[ICCV'2023] Compositional Feature Augmentation for Unbiased Scene Graph Generation

Jupyter Notebook 14 2 Updated Dec 5, 2023

ROOT: VLM based System for Indoor Scene Understanding and Beyond

Jupyter Notebook 22 Updated Jan 22, 2025

This is the code of ECCV 2022 (Oral) paper "Fine-Grained Scene Graph Generation with Data Transfer".

Jupyter Notebook 99 7 Updated Jan 24, 2023

[WACV 2025] This is the official implementation of the paper "Enhancing Scene Graph Generation with Hierarchical Relationships and Commonsense Knowledge" in PyTorch.

Python 25 2 Updated Oct 29, 2024

[CVPR 2024] Code for HiKER-SGG: Hierarchical Knowledge Enhanced Robust Scene Graph Generation

Python 67 2 Updated Oct 11, 2024
Next