Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and…

Python 43,554 7,766 Updated Oct 18, 2024

hpcaitech / ColossalAI

Making large AI models cheaper, faster and more accessible

Python 38,740 4,337 Updated Oct 18, 2024

karpathy / nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 36,806 5,821 Updated Aug 19, 2024

babysor / MockingBird

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 35,143 5,207 Updated Oct 16, 2024

coqui-ai / TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 34,764 4,223 Updated Aug 16, 2024

hiroi-sora / Umi-OCR

OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片，PDF文档识别，排除水印/页眉页脚，扫描/生成二维码。内置多国语言库。

Python 26,566 2,669 Updated Oct 18, 2024

JaidedAI / EasyOCR

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

Python 24,162 3,141 Updated Sep 24, 2024

deepinsight / insightface

State-of-the-art 2D and 3D Face Analysis Project

Python 23,166 5,387 Updated Oct 11, 2024

fauxpilot / fauxpilot

FauxPilot - an open-source alternative to GitHub Copilot server

Python 14,575 621 Updated Apr 9, 2024

mementum / backtrader

Python Backtesting library for trading strategies

Python 14,278 3,902 Updated Aug 19, 2024

serengil / deepface

A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python

Python 12,981 2,105 Updated Oct 18, 2024

VikParuchuri / surya

OCR, layout analysis, reading order, table recognition in 90+ languages

Python 12,601 798 Updated Oct 18, 2024

chidiwilliams / buzz

Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.

Python 12,308 931 Updated Oct 19, 2024

Embedding / Chinese-Word-Vectors

100+ Chinese Word Vectors 上百种预训练中文词向量

Python 11,807 2,316 Updated Oct 30, 2023

Rudrabha / Wav2Lip

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs

Python 10,571 2,260 Updated Sep 24, 2024

ymcui / Chinese-BERT-wwm

Pre-Training with Whole Word Masking for Chinese BERT（中文BERT-wwm系列模型）

Python 9,626 1,386 Updated Jul 31, 2023

MVIG-SJTU / AlphaPose

Real-Time and Accurate Full-Body Multi-Person Pose Estimation&Tracking System

Python 7,985 1,969 Updated May 13, 2024

nl8590687 / ASRT_SpeechRecognition

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

Python 7,805 1,892 Updated Sep 26, 2024

openai / jukebox

Code for the paper "Jukebox: A Generative Model for Music"

Python 7,800 1,402 Updated Jun 19, 2024

Morizeyao / GPT2-Chinese

Chinese version of GPT2 training code, using BERT tokenizer.

Python 7,455 1,702 Updated Apr 25, 2024

PantsuDango / Dango-Translator

团子翻译器 —— 个人兴趣制作的一款基于OCR技术的翻译器

Python 7,026 523 Updated Oct 2, 2024

jaywalnut310 / vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Python 6,786 1,236 Updated Dec 6, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

qq90627

Block or report qq90627

Starred repositories

AUTOMATIC1111 / stable-diffusion-webui

huggingface / transformers

yt-dlp / yt-dlp

tensorflow / models

openai / whisper

meta-llama / llama

CorentinJ / Real-Time-Voice-Cloning

deepfakes / faceswap

PaddlePaddle / PaddleOCR