Skip to content
View AlanLiudx's full-sized avatar

Block or report AlanLiudx

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

12 Weeks, 24 Lessons, AI for All!

Jupyter Notebook 35,357 6,097 Updated Nov 11, 2024

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 37,595 4,270 Updated Dec 19, 2024

awesome image and video denoising, state of the art networks

609 110 Updated Apr 26, 2023

The Cone of Silence:

Python 151 22 Updated Apr 27, 2022
MATLAB 23 7 Updated May 14, 2020

Implementation of NN based mask estimator in pytorch

Python 31 14 Updated Dec 5, 2017

DDSP: Differentiable Digital Signal Processing

Python 2,935 345 Updated Sep 23, 2024

Data manipulation and transformation for audio signal processing, powered by PyTorch

Python 2,573 664 Updated Dec 31, 2024

关于语音信号声源定位DOA估计所用的一些传统算法

MATLAB 394 84 Updated Jun 30, 2021

LaTeX Thesis Template for Tsinghua University

TeX 4,656 1,088 Updated Dec 30, 2024
Python 6 1 Updated Nov 2, 2022

End-to-End Speech Processing Toolkit

Python 8,636 2,199 Updated Dec 31, 2024

Several methods of generating phase-only Fresnel hologram for representing a multiple depth object.

MATLAB 9 1 Updated Nov 9, 2022

PyTorch implementation of some attentions for Deep Learning Researchers.

Python 1 Updated Mar 4, 2022

A collection of MATLAB routines for acoustical array processing on spherical harmonic signals, commonly captured with a spherical microphone array.

MATLAB 168 64 Updated Jan 24, 2024

This repo provides the network code and the processed samples of the manuscript "Glance and Gaze: A Collaborative Learning Framework for Single-channel Speech Enhancement", which was accepted by El…

Python 65 8 Updated Feb 10, 2022

an open-source implementation of sequence-to-sequence based speech processing engine

C++ 961 197 Updated Dec 2, 2022

Pytorch implementation of LearnableUpsamplingLayer (NaturalSpeech, Tan et al., 2022)

Python 54 2 Updated Mar 12, 2024

Generator for anechoic, non-stationary noise signals

Python 11 1 Updated Aug 12, 2022

Causality Check in Frame-online Speech Separation

Python 43 3 Updated Dec 11, 2022

The official PyTorch implementation of "FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement".

Python 247 56 Updated Apr 23, 2024

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…

Python 11,334 1,865 Updated Dec 31, 2024
Python 179 27 Updated Dec 4, 2023

Noise supression using deep filtering

Python 2,625 244 Updated Oct 17, 2024

Figure sizes, font sizes, fonts, and more configurations at minimal overhead. Fix your journal papers, conference proceedings, and other scientific publications.

Python 678 25 Updated Oct 17, 2024

This is the microphone array generalization investigation based on previous Narrow Band Deep Filtering methods.

Python 36 9 Updated Mar 12, 2024

A PyTorch Lightning extension that accelerates and enhances foundation model experimentation with flexible fine-tuning schedules.

Python 60 4 Updated Dec 21, 2024
Python 19 2 Updated Jun 30, 2023
Python 3 Updated Jul 2, 2022
Next