-
www.ilovepose.com
- China
- www.ilovepose.cn
Stars
Solve Visual Understanding with Reinforced VLMs
Fully open reproduction of DeepSeek-R1
Official repository of paper "LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of Large Language Models"
🔥 全网首发,mmdetection Co-DETR TensorRT端到端推理加速
Fine tuning grounding Dino
trzsz is a simple file transfer tools, similar to lrzsz ( rz / sz ), and compatible with tmux.
Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…
A collection of some awesome public object detection and recognition datasets.
YOLO-UniOW: Efficient Universal Open-World Object Detection
The official PyTorch implementation of Google's Gemma models
This is the official implementation of ICLR 2024 paper "VDC: Versatile Data Cleanser based on Visual-Linguistic Inconsistency by Multimodal Large Language Models".
[ICCV 2023] VAD: Vectorized Scene Representation for Efficient Autonomous Driving
[CVPR 2023 Best Paper Award] Planning-oriented Autonomous Driving
Fourier Domain Adaptation for Semantic Segmentation
Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷
Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
[BMVC'23] Prompting Visual-Language Models for Dynamic Facial Expression Recognition
YOLOX with SwinTransformer backbone.
High-resolution models for human tasks.
[ECCV 2024] Tokenize Anything via Prompting
MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Official implementation of OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
The AI developer platform. Use Weights & Biases to train and fine-tune models, and manage models from experimentation to production.
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)