Stars
12 Weeks, 24 Lessons, AI for All!
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
awesome image and video denoising, state of the art networks
Implementation of NN based mask estimator in pytorch
Data manipulation and transformation for audio signal processing, powered by PyTorch
关于语音信号声源定位DOA估计所用的一些传统算法
Several methods of generating phase-only Fresnel hologram for representing a multiple depth object.
AlanLiudx / attentions
Forked from sooftware/attentionsPyTorch implementation of some attentions for Deep Learning Researchers.
A collection of MATLAB routines for acoustical array processing on spherical harmonic signals, commonly captured with a spherical microphone array.
This repo provides the network code and the processed samples of the manuscript "Glance and Gaze: A Collaborative Learning Framework for Single-channel Speech Enhancement", which was accepted by El…
an open-source implementation of sequence-to-sequence based speech processing engine
Pytorch implementation of LearnableUpsamplingLayer (NaturalSpeech, Tan et al., 2022)
Generator for anechoic, non-stationary noise signals
Causality Check in Frame-online Speech Separation
The official PyTorch implementation of "FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement".
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…
Noise supression using deep filtering
Figure sizes, font sizes, fonts, and more configurations at minimal overhead. Fix your journal papers, conference proceedings, and other scientific publications.
This is the microphone array generalization investigation based on previous Narrow Band Deep Filtering methods.
A PyTorch Lightning extension that accelerates and enhances foundation model experimentation with flexible fine-tuning schedules.