Stars
Python implementation of performance metrics in Loizou's Speech Enhancement book
Command line utility for forced alignment using Kaldi
The implementation of "X-TF-GridNet: A Time-Frequency Domain Target Speaker Extraction Network with Adaptive Speaker Embedding Fusion", which is accepted by Information Fusion.
MIMII Sound Anomaly Detection with AutoEncoders
PyTorch implementation of the paper "Semi-Supervised Acoustic Anomaly Detection via Contrastive Learning"
A spectro-temporal fusion feature, STgram, with MobileFaceNet For more stable Anomalous Sound Detection
paper for Anomalous sound detection
A platform for the collaborative creation of open audio collections labeled by humans and based on Freesound content.
An infant cry audio corpus that's being built through the Donate-a-cry campaign - see http://donateacry.com
Recognition of baby cry audio signal
A lightweight, portable pure C99 onnx inference engine for embedded devices with hardware acceleration support.
Hidden Markov Models in Python, with scikit-learn like API
Implementation of Axial attention - attending to multi-dimensional data efficiently
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.
A PyTorch implementation of Time-domain Audio Separation Network (TasNet) with Permutation Invariant Training (PIT) for speech separation.
Extract phone numbers from an audio recording of the dial tones.
📞 Using Matlab to simulate Dual-Tone Multi-Frequency (DTMF) of telephone
Neural audio plugin trained on custom analog fuzz circuit design
Efficient neural networks for analog audio effect modeling