Starred repositories
Stable Diffusion web UI
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
A feature-rich command-line audio/video downloader
Models and examples built with TensorFlow
Robust Speech Recognition via Large-Scale Weak Supervision
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and…
Making large AI models cheaper, faster and more accessible
The simplest, fastest repository for training/finetuning medium-sized GPTs.
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
State-of-the-art 2D and 3D Face Analysis Project
FauxPilot - an open-source alternative to GitHub Copilot server
Python Backtesting library for trading strategies
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
OCR, layout analysis, reading order, table recognition in 90+ languages
Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.
100+ Chinese Word Vectors 上百种预训练中文词向量
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)
Real-Time and Accurate Full-Body Multi-Person Pose Estimation&Tracking System
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
Code for the paper "Jukebox: A Generative Model for Music"
Chinese version of GPT2 training code, using BERT tokenizer.
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech