Stars
🚀 Kick-start your C++! A template for modern C++ projects using CMake, CI, code coverage, clang-format, reproducible dependency management and much more.
Ready-to-use SRT / WebRTC / RTSP / RTMP / LL-HLS media server and media proxy that allows to read, publish, proxy, record and playback video and audio streams.
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
Strong and Open Vision Language Assistant for Mobile Devices
Reading list for research topics in multimodal machine learning
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合
Run tensorrt yolov5 on Jetson devices, supports yolov5s, yolov5m, yolov5l, yolov5x.
Build and install OpenCV for the NVIDIA Jetson AGX Xavier
BoxMOT: pluggable SOTA tracking modules for segmentation, object detection and pose estimation models
A collaboration friendly studio for NeRFs
[ECCV 2024] Single Image to 3D Textured Mesh in 10 seconds with Convolutional Reconstruction Model.
Generate 3D objects conditioned on text or images
A PyTorch implementation of NeRF (Neural Radiance Fields) that reproduces the results.
基于pytorch的ocr算法库,包括 psenet, pan, dbnet, sast , crnn
OneDiff: An out-of-the-box acceleration library for diffusion models.
Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information
This may be the simplest implement of DDPM. You can directly run Main.py to train the UNet on CIFAR-10 dataset and see the amazing process of denoising.
Minimal But Practical Image Classifier Pipline Using Pytorch, Finetune on ResNet18, Got 99% Accuracy on Own Small Datasets.
✨✨Latest Advances on Multimodal Large Language Models
Python Kalman filtering and optimal estimation library. Implements Kalman filter, particle filter, Extended Kalman filter, Unscented Kalman filter, g-h (alpha-beta), least squares, H Infinity, smoo…