Starred repositories
Command-line program to download videos from YouTube.com and other video sites
A feature-rich command-line audio/video downloader
Robust Speech Recognition via Large-Scale Weak Supervision
The world's simplest facial recognition api for Python and the command line
YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite
A Gradio web UI for Large Language Models with support for multiple inference backends.
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
A community-supported supercharged version of paperless: scan, index and archive all your physical documents
Free and Open Source Enterprise Resource Planning (ERP)
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
Universal LLM Deployment Engine with ML Compilation
Mail-in-a-Box helps individuals take back control of their email by defining a one-click, easy-to-deploy SMTP+everything else server: a mail server in a box.
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/
Script for downloading Coursera.org videos and naming them.
🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading
LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath
Scan, index, and archive all of your paper documents
Home of StarCoder: fine-tuning & inference!
An open source library for deep learning end-to-end dialog systems and chatbots.
SD.Next: All-in-one for AI generative image
An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.
A library to manipulate font files from Python.
docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
A web frontend for the motion daemon.
Pretrained language model with 100B parameters