Skip to content
View nanless's full-sized avatar

Block or report nanless

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

speaker recognition

26 repositories

An Open Source Tools for Speaker Recognition

Python 612 130 Updated Aug 5, 2024

In defence of metric learning for speaker recognition

Python 1,084 275 Updated Mar 26, 2024

speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition

Python 479 120 Updated Jul 1, 2021

This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM++, etc. It is not excluded that more models will be supported in the future. At the …

Python 904 134 Updated Feb 20, 2025

The VoxTube dataset official repository

HTML 68 1 Updated Feb 14, 2024

Official repository for RawNet, RawNet2, and RawNet3

Python 370 54 Updated Mar 21, 2024

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 20,801 2,597 Updated Feb 6, 2025
JavaScript 172 19 Updated Dec 1, 2023

🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / DeepSeek / Qwen), Knowledge Base (file upload / knowledge managemen…

TypeScript 56,367 12,029 Updated Feb 26, 2025

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 6,922 835 Updated Feb 24, 2025

The repo provides information about KeSpeech dataset.

135 10 Updated Oct 13, 2022

Large, modern dataset for speech recognition

Shell 663 62 Updated Feb 26, 2024

Deep Speaker: an End-to-End Neural Speaker Embedding System.

Python 919 241 Updated Apr 13, 2024

Facebook AI Research's Automatic Speech Recognition Toolkit

C++ 6,408 1,012 Updated Nov 23, 2024

Library for Textless Spoken Language Processing

Python 532 52 Updated Aug 29, 2023

NOTSOFAR-1 Challenge: Distant Diarization and ASR

Python 50 12 Updated Feb 12, 2025

Record voice notes & transcribe, summarize, and get tasks

TypeScript 1,851 306 Updated Feb 11, 2025

本项目使用了EcapaTdnn、ResNetSE、ERes2Net、CAM++等多种先进的声纹识别模型,同时本项目也支持了MelSpectrogram、Spectrogram、MFCC、Fbank等多种数据预处理方法

Python 255 47 Updated Feb 20, 2025

Official repository of NeXt-TDNN for speaker verification

Python 65 7 Updated Oct 10, 2024

This repository contains the training, inference, evaluation code for SpeechLLM models and details about the model releases on huggingface.

Python 84 7 Updated Jun 25, 2024

Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion

Python 141 21 Updated Oct 16, 2023

The official pytorch implemention of the Intespeech 2024 paper "Reshape Dimensions Network for Speaker Recognition"

Python 140 8 Updated Nov 14, 2024

Open source inference code for Rev's model

Python 377 25 Updated Jan 17, 2025

Official Repository For VoxBlink2

Python 62 4 Updated Aug 13, 2024

ddtse demo for slt2024

HTML 3 Updated Oct 7, 2024