Stars
Python library for extracting chords from multiple sound file formats
A simple screen parsing tool towards pure vision based GUI agent
🚀 Efficient implementations of state-of-the-art linear attention models in Torch and Triton
Implementation of the proposed minGRU in Pytorch
This is the official implementation of the LiSenNet
Apply Score diffusion to improve speech signals recorded under various adverse conditions and distortions, including noise, reverberation, clipping, equalization (EQ) distortion, packet loss, codec…
Port of Funasr's Sense-voice model in C/C++
Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.
该代码与B站上的视频 https://www.bilibili.com/video/BV18L41197Uz/?spm_id_from=333.788&vd_source=eefa4b6e337f16d87d87c2c357db8ca7 相关联。
This is the code and dataset repo for Interspeech 2024 paper "Target conversation extraction: Source separation using turn-taking dynamics"
ESC-50: Dataset for Environmental Sound Classification
On-device noise suppression powered by deep learning
Keep track of good articles on speech processing, mainly on speech enhancement, include speech denoise, speech dereverberation and aec、agc, etc.
A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NeurIPS 2024]
Official implementation of "Separate Anything You Describe"
Real-time microphone noise suppression on Linux.
This is the CoNNear human auditory periphery model that simulates cochlear, IHC and AN processing across the human hearing range.
SonicSim: A customizable simulation platform for speech processing in moving sound source scenarios
A curated list of neural network pruning resources.
On-device AI across mobile, embedded and edge for PyTorch