nanless

Follow

nanless

Follow

38 followers · 433 following

Achievements

Achievements

Stars

speaker recognition

26 repositories

Snowdar / asv-subtools

An Open Source Tools for Speaker Recognition

Python 612 130 Updated Aug 5, 2024

clovaai / voxceleb_trainer

In defence of metric learning for speaker recognition

Python 1,084 275 Updated Mar 26, 2024

taylorlu / Speaker-Diarization

speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition

Python 479 120 Updated Jul 1, 2021

yeyupiaoling / VoiceprintRecognition-Pytorch

This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM++, etc. It is not excluded that more models will be supported in the future. At the …

Python 904 134 Updated Feb 20, 2025

IDRnD / VoxTube

The VoxTube dataset official repository

HTML 68 1 Updated Feb 14, 2024

Jungjee / RawNet

Official repository for RawNet, RawNet2, and RawNet3

Python 370 54 Updated Mar 21, 2024

microsoft / unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 20,801 2,597 Updated Feb 6, 2025

WeberJulian / AI-voice-chat

JavaScript 172 19 Updated Dec 1, 2023

lobehub / lobe-chat

🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / DeepSeek / Qwen), Knowledge Base (file upload / knowledge managemen…

TypeScript 56,367 12,029 Updated Feb 26, 2025

pyannote / pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 6,922 835 Updated Feb 24, 2025

KeSpeech / KeSpeech

The repo provides information about KeSpeech dataset.

135 10 Updated Oct 13, 2022

SpeechColab / GigaSpeech

Large, modern dataset for speech recognition

Shell 663 62 Updated Feb 26, 2024

philipperemy / deep-speaker

Deep Speaker: an End-to-End Neural Speaker Embedding System.

Python 919 241 Updated Apr 13, 2024

flashlight / wav2letter

Facebook AI Research's Automatic Speech Recognition Toolkit

C++ 6,408 1,012 Updated Nov 23, 2024

facebookresearch / textlesslib

Library for Textless Spoken Language Processing

Python 532 52 Updated Aug 29, 2023

yinruiqing / pyannote-whisper

Python 563 97 Updated May 11, 2024

microsoft / NOTSOFAR1-Challenge

NOTSOFAR-1 Challenge: Distant Diarization and ASR

Python 50 12 Updated Feb 12, 2025

Nutlope / notesGPT

Record voice notes & transcribe, summarize, and get tasks

TypeScript 1,851 306 Updated Feb 11, 2025

yeyupiaoling / VoiceprintRecognition-PaddlePaddle

本项目使用了EcapaTdnn、ResNetSE、ERes2Net、CAM++等多种先进的声纹识别模型，同时本项目也支持了MelSpectrogram、Spectrogram、MFCC、Fbank等多种数据预处理方法

Python 255 47 Updated Feb 20, 2025

dmlguq456 / NeXt_TDNN_ASV

Official repository of NeXt-TDNN for speaker verification

Python 65 7 Updated Oct 10, 2024

skit-ai / SpeechLLM

This repository contains the training, inference, evaluation code for SpeechLLM models and details about the model releases on huggingface.

Python 84 7 Updated Jun 25, 2024

ConsistencyVC / ConsistencyVC-voive-conversion

Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion

Python 141 21 Updated Oct 16, 2023

IDRnD / redimnet

The official pytorch implemention of the Intespeech 2024 paper "Reshape Dimensions Network for Speaker Recognition"

Python 140 8 Updated Nov 14, 2024

revdotcom / reverb

Open source inference code for Rev's model

Python 377 25 Updated Jan 17, 2025

VoxBlink2 / ScriptsForVoxBlink2

Official Repository For VoxBlink2

Python 62 4 Updated Aug 13, 2024

vivian556123 / slt2024-ddtse

ddtse demo for slt2024

HTML 3 Updated Oct 7, 2024