speaker recognition
An Open Source Tools for Speaker Recognition
In defence of metric learning for speaker recognition
speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition
This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM++, etc. It is not excluded that more models will be supported in the future. At the …
Official repository for RawNet, RawNet2, and RawNet3
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / DeepSeek / Qwen), Knowledge Base (file upload / knowledge managemen…
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
The repo provides information about KeSpeech dataset.
Large, modern dataset for speech recognition
Deep Speaker: an End-to-End Neural Speaker Embedding System.
Facebook AI Research's Automatic Speech Recognition Toolkit
Library for Textless Spoken Language Processing
NOTSOFAR-1 Challenge: Distant Diarization and ASR
Record voice notes & transcribe, summarize, and get tasks
本项目使用了EcapaTdnn、ResNetSE、ERes2Net、CAM++等多种先进的声纹识别模型,同时本项目也支持了MelSpectrogram、Spectrogram、MFCC、Fbank等多种数据预处理方法
Official repository of NeXt-TDNN for speaker verification
This repository contains the training, inference, evaluation code for SpeechLLM models and details about the model releases on huggingface.
Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion
The official pytorch implemention of the Intespeech 2024 paper "Reshape Dimensions Network for Speaker Recognition"