Highlights
Stars
A way to experience modded websites and you can install newer apps without fighting with Tizen Studio
Improve your security and privacy by blocking ads, tracking and malware domains.
This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.
Metadata and versioning details for the Common Voice dataset
Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
A feature-rich command-line audio/video downloader
A librosa STFT/Fbank/mfcc feature extration written up in PyTorch using 1D Convolutions.
This is a pytorch implementation of k-means clustering algorithm
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
A parser, editor and profiler tool for ONNX models.
Extension for Scikit-learn is a seamless way to speed up your Scikit-learn application
Large scale K-means and K-nn implementation on NVIDIA GPU / CUDA
A pytorch quantization backend for optimum
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
UniSpeech - Large Scale Self-Supervised Learning for Speech
21 Lessons, Get Started Building with Generative AI đź”— https://microsoft.github.io/generative-ai-for-beginners/
Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch
The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based attractors". [ICASSP 2024] and "LS-EEND: long-form streaming…
Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker Diarization (Dockerfile, CI image build and test)
A pytorch implementation of the paper "ANSD-MA-MSE: Adaptive Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding"
CHIME-7/8 diarization champion system: neural speaker diarization using memory-aware multi-speaker embedding with sequence-to-sequence architecture
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram