Stars
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Fast and accurate automatic speech recognition (ASR) for edge devices
A developer reference project for creating Retrieval Augmented Generation (RAG) chatbots on Windows using TensorRT-LLM
FSA/FST algorithms, differentiable, with PyTorch compatibility.
Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English.
Official repository of SepReformer for speech separation
🔥LeetCode solutions in any programming language | 多种编程语言实现 LeetCode、《剑指 Offer(第 2 版)》、《程序员面试金典(第 6 版)》题解
In defence of metric learning for speaker recognition
Production First and Production Ready End-to-End Keyword Spotting Toolkit
Simple implementation of mean shift clustering in python
🔈 Deep Learning & 3D Convolutional Neural Networks for Speaker Verification
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
This repo summarizes the tutorials, datasets, papers, codes and tools for speech separation and speaker extraction task. You are kindly invited to pull requests.
ty274 / rir-generator
Forked from ehabets/RIR-GeneratorRoom Impulse Response Generator
Pretrained Pytorch face detection (MTCNN) and facial recognition (InceptionResnet) models
papers about Face Detection; Face Alignment; Face Recognition && Face Identification && Face Verification && Face Representation; Face Reconstruction; Face Tracking; Face Super-Resolution && Face D…
Trained models for the face_recognition python library
The world's simplest facial recognition api for Python and the command line
Deep learning face detection and recognition, implemented by pytorch. (pytorch实现的人脸检测和人脸识别)