Stars
ModelScope: bring the notion of Model-as-a-Service to life.
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
Code for the paper "Language Models are Unsupervised Multitask Learners"
Acoustic Echo Cancellation with Nerual Kalman Filtering
This repo contains the official PyTorch implementation of "A Systematic Comparison of Phonetic Aware Techniques for Speech Enhancement" (Interspeech 2022)
A Python implementation of the Speech Intelligibility Index
Collection of audio-focused loss functions in PyTorch
Clarity Challenge toolkit - software for building Clarity Challenge systems
Robust Speech Recognition via Large-Scale Weak Supervision
A Whiteboard editor for Standard Notes based on TLDRAW
Simple text to phones converter for multiple languages
Multi-Phase Gammatone Filterbank (MP-GTF) construction for Python
A python library for working with praat, textgrids, time aligned audio transcripts, and audio files. It is primarily used for extracting features from and making manipulations on audio files given …
Command line utility for forced alignment using Kaldi
VIP cheatsheets for Stanford's CS 221 Artificial Intelligence
Implementing Stand-Alone Self-Attention in Vision Models using Pytorch
Deep Xi: A deep learning approach to a priori SNR estimation implemented in TensorFlow 2/Keras. For speech enhancement and robust ASR.
PyTorch Lightning + Hydra. A very user-friendly template for ML experimentation. ⚡🔥⚡
Deep-Learning-Based Audio-Visual Speech Enhancement and Separation
Official repo for "A MODULATION-DOMAIN LOSS FOR NEURAL-NETWORK-BASED REAL-TIME SPEECH ENHANCEMENT" to appear in ICASSP 2021
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.