LLaVA-Mini is a unified large multimodal model (LMM) that can support the understanding of images, high-resolution images, and videos in an efficient manner.

Python 398 16 Updated Jan 13, 2025

Deep-Agent / R1-V

Witness the aha moment of VLM with less than $3.

Python 3,095 242 Updated Mar 1, 2025

dair-ai / ml-visuals

🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.

14,353 1,431 Updated Feb 13, 2023

PKU-ICST-MIPL / Finedefics_ICLR2025

Python 34 2 Updated Feb 24, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 22,408 2,010 Updated Mar 9, 2025

deepseek-ai / DeepSeek-R1

85,570 11,046 Updated Feb 24, 2025

deepseek-ai / DeepSeek-VL2

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Python 4,442 1,655 Updated Feb 26, 2025

LLaVA-VL / LLaVA-NeXT

Python 3,498 324 Updated Feb 24, 2025

jiwoon-ahn / irn

Weakly Supervised Learning of Instance Segmentation with Inter-pixel Relations, CVPR 2019 (Oral)

Python 531 98 Updated May 1, 2023

rulixiang / ToCo

[CVPR 2023] Token Contrast for Weakly-Supervised Semantic Segmentation

Python 159 12 Updated May 8, 2023

Ferenas / CPN

Complementary Patch for Weakly Supervised Semantic Segmentation, ICCV21 (poster)

Python 24 2 Updated Nov 8, 2021

bertrik / CameraWebServer

Platformio build of the ESP32 CameraWebServer code example

C++ 5 5 Updated Aug 14, 2021

SoyBeanMilkx / ESP32-Cam-StableDiffusion

Esp32-cam拍摄照片发送给电脑，电脑跑StableDiffusion进行AI重绘

C 4 Updated Feb 13, 2024

Qengineering / Jetson-Nano-Ubuntu-20-image

Jetson Nano with Ubuntu 20.04 image

794 82 Updated Jan 21, 2025

YunaiV / ruoyi-vue-pro

🔥 官方推荐 🔥 RuoYi-Vue 全新 Pro 版本，优化重构所有功能。基于 Spring Boot + MyBatis Plus + Vue & Element 实现的后台管理系统 + 微信小程序，支持 RBAC 动态权限、数据权限、SaaS 多租户、Flowable 工作流、三方登录、支付、短信、商城、CRM、ERP、AI 大模型等功能。你的 ⭐️ Star ⭐️，是作者生发的动力！

Java 29,508 6,388 Updated Feb 15, 2025

yfzhang114 / Generalization-Causality

关于domain generalization，domain adaptation，causality，robutness，prompt，optimization，generative model各式各样研究的阅读笔记

1,192 102 Updated Dec 14, 2023

WZMIAOMIAO / deep-learning-for-image-processing

deep learning for image processing including classification and object-detection etc.

Python 24,160 8,118 Updated Jan 12, 2025

fundamentalvision / BEVFormer

[ECCV 2022] This is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.

Python 3,614 579 Updated Aug 15, 2024

ApolloAuto / apollo

An open autonomous driving platform

C++ 25,559 9,761 Updated Dec 31, 2024

initial-h / AlphaZero_Gomoku_MPI

An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku

Python 200 45 Updated Feb 28, 2025

HTLife / VINet

VINet: Visual-Inertial Odometry as a Sequence-to-Sequence Learning Problem

Python 210 65 Updated Jun 28, 2018

yun-liu / DEL

DEL: Deep Embedding Learning for Efficient Image Segmentation

C++ 63 14 Updated Dec 22, 2023

Wzhjerry / autoSMIM

[TMI' 23] autoSMIM: Automatic Superpixel-based Masked Image Modeling for Skin Lesion Segmentation

Python 20 2 Updated Jul 25, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Lchengkun

Block or report Lchengkun

Stars

facebookresearch / dinov2

pengzhiliang / MAE-pytorch

facebookresearch / ToMe

UMass-Embodied-AGI / Mod-Squad

Theia-4869 / FasterVLM

ictnlp / LLaVA-Mini