Stars
Faster Whisper transcription with CTranslate2
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
A python package to analyze and compare voices with deep learning
Sublime Text 2 setup used in the Ruby on Rails Tutorial
Minimal extension of OpenAI's Whisper adding speaker diarization with special tokens
A Python library for measuring the acoustic features of speech (simultaneous speech, high entropy) compared to ones of native speech.
🔉 👦 👧Voice based gender recognition using Mel-frequency cepstrum coefficients (MFCC) and Gaussian mixture models (GMM)
Segtok v2 is here: https://github.com/fnl/syntok -- A rule-based sentence segmenter (splitter) and a word tokenizer using orthographic features.
Vietnamese tokenizer (Maximum Matching and CRF)
SegEval Segmentation Evaluation Package