Stars
The Harmonix Set: Beats, Downbeats, and Structural Annotations for Pop Music
Acoustic Echo Cancellation with Nerual Kalman Filtering
Multi-Scale Temporal Frequency Convolutional Network With Axial Attention for Speech Enhancement
mirror of https://chromium.googlesource.com/external/webrtc
This Repostory contains the pretrained DTLN-aec model for real-time acoustic echo cancellation.
Official data preparation scripts for the URGENT 2024 Challenge
This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.
PESQ (Perceptual Evaluation of Speech Quality) Wrapper for Python Users (narrow band and wide band)
razorenhua / AEC-Challenge
Forked from microsoft/AEC-ChallengeAEC Challenge
This is the official implementation of the SEMamba paper. (Accepted to IEEE SLT 2024)
ShenYi666666 / DNS-Challenge
Forked from microsoft/DNS-ChallengeThis repo contains the scripts, models and required files for the Interspeech 2020 Deep Noise Suppression (DNS) Challenge. We are open sourcing clean speech and noise files as well. Participants of…
Noise supression using deep filtering
This repo hosts the code and models of "Masked Autoencoders that Listen".
The Official PyTorch Implementation of FN-SSL & IPDnet for Sound Source Localization [INTERSPEECH2023 & TASLP2024]
Learning the Beauty in Songs: Neural Singing Voice Beautifier; ACL 2022 (Main conference); Official code
A timeline of the latest AI models for audio generation, starting in 2023!
Codes for ICASSP 2024 paper: BEAST: Online Joint Beat and Downbeat Tracking Based on Streaming Transformer. An online beat tracking system based on streaming Transformer
⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。
Implementation of the Wave-U-Net for audio source separation
Codes for ISMIR 2022 paper: Beat Transformer: Demixed Beat and Downbeat Tracking with Dilated Self-Attention
ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal proce…
Ideal Ratio Mask (IRM) Estimation based Speech Enhancement using LSTM
Sound event localization, detection, and tracking of multiple overlapping and moving sources in 2D spherical space using convolutional recurrent neural network
A two-stage polyphonic sound event detection and localization method for both SED and DOA.