Starred repositories
MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
An open source implementation of CLIP.
分享一些好用的 Dify DSL 工作流程,自用、学习两相宜。 Sharing some Dify workflows.
使用Nanodet+YoloV8-Pose实现指针仪表的实时检测、高精度读数识别(借助ncnn框架)
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
Make websites accessible for AI agents
A powerful OCR (Optical Character Recognition) package that uses state-of-the-art vision language models
Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.
China-Balanced-License-Plate-Recognition-Dataset-330k:A balanced dataset of 330,000 images featuring various types of Chinese license plates for recognition tasks, ideal for training and evaluating…
Git with a cup of tea! Painless self-hosted all-in-one software development service, including Git hosting, code review, team collaboration, package registry and CI/CD
Concise, consistent, and legible badges in SVG and raster format
Repo for counting stars and contributing. Press F to pay respect to glorious developers.
License Plate Recognition For Car With Python And OpenCV
Detects license plate of car and recognizes its characters
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
YOLO5Face: Why Reinventing a Face Detector (https://arxiv.org/abs/2105.12931) ECCV Workshops 2022)
YOLOv3 in PyTorch > ONNX > CoreML > TFLite
Example of using ultralytics YOLO V5 with OpenCV 4.5.4, C++ and Python
Building a quick conversation-based search demo with Lepton AI.
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!
Labeling tool with SAM(segment anything model),supports SAM, SAM2, sam-hq, MobileSAM EdgeSAM etc.交互式半自动图像标注工具
基于PyTorch&YOLOv4实现的口罩佩戴检测 ⭐ 自建口罩数据集分享