Skip to content
View HuAndrew's full-sized avatar

Block or report HuAndrew

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Solve Visual Understanding with Reinforced VLMs

Python 3,447 203 Updated Feb 26, 2025

Fully open reproduction of DeepSeek-R1

Python 21,570 1,906 Updated Feb 26, 2025

Official repository of paper "LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of Large Language Models"

Python 55 2 Updated Feb 22, 2025

🔥 全网首发,mmdetection Co-DETR TensorRT端到端推理加速

C++ 26 5 Updated Nov 27, 2024

Fine tuning grounding Dino

Python 86 11 Updated Dec 22, 2024

trzsz is a simple file transfer tools, similar to lrzsz ( rz / sz ), and compatible with tmux.

Python 1,263 57 Updated Jan 28, 2024

Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…

Python 7,580 482 Updated Feb 12, 2025
Jupyter Notebook 383 40 Updated Jun 25, 2024

A collection of some awesome public object detection and recognition datasets.

78 7 Updated Jan 7, 2025

YOLO-UniOW: Efficient Universal Open-World Object Detection

Python 77 7 Updated Jan 17, 2025

The official PyTorch implementation of Google's Gemma models

Python 5,364 520 Updated Jan 6, 2025

This is the official implementation of ICLR 2024 paper "VDC: Versatile Data Cleanser based on Visual-Linguistic Inconsistency by Multimodal Large Language Models".

Python 14 1 Updated Feb 24, 2025

[ICCV 2023] VAD: Vectorized Scene Representation for Efficient Autonomous Driving

Python 859 92 Updated Feb 19, 2025

[CVPR 2023 Best Paper Award] Planning-oriented Autonomous Driving

Python 3,787 432 Updated Aug 28, 2024

Fourier Domain Adaptation for Semantic Segmentation

Python 513 82 Updated Jul 1, 2020

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

Python 3,748 211 Updated Feb 26, 2025

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 8,066 569 Updated Feb 26, 2025

[BMVC'23] Prompting Visual-Language Models for Dynamic Facial Expression Recognition

Python 117 12 Updated Nov 21, 2024

YOLOX with SwinTransformer backbone.

Python 32 13 Updated Mar 4, 2022

High-resolution models for human tasks.

Python 4,844 286 Updated Nov 18, 2024

[ECCV 2024] Tokenize Anything via Prompting

Jupyter Notebook 560 24 Updated Dec 11, 2024

Sparse4D v1 & v2

Python 431 46 Updated Jun 25, 2024

MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone

Python 18,690 1,336 Updated Feb 21, 2025

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 37,052 4,263 Updated Feb 26, 2025

Official implementation of OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion

Python 291 19 Updated Feb 6, 2025

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 14,257 1,458 Updated Dec 25, 2024

The AI developer platform. Use Weights & Biases to train and fine-tune models, and manage models from experimentation to production.

Python 9,552 705 Updated Feb 26, 2025

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Python 25,587 2,925 Updated Sep 2, 2024
Next