Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.

Python 1,488 434 Updated Dec 8, 2024

microsoft / DNS-Challenge

This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.

Python 1,140 415 Updated Jul 25, 2024

yeyupiaoling / PPASR

基于PaddlePaddle实现端到端中文语音识别，从入门到实战，超简单的入门案例，超实用的企业项目。支持当前最流行的DeepSpeech2、Conformer、Squeezeformer模型

Python 832 130 Updated Nov 25, 2024

Audio-WestlakeU / FullSubNet

PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."

Python 558 158 Updated Aug 19, 2023

facebookresearch / AudioMAE

This repo hosts the code and models of "Masked Autoencoders that Listen".

Python 557 47 Updated Apr 5, 2024

zcablii / LSKNet

(IJCV2024 & ICCV2023) LSKNet: A Foundation Lightweight Backbone for Remote Sensing

Python 505 41 Updated Oct 7, 2024

hbrobotics / ros_arduino_bridge

ROS + Arduino = Robot

Python 357 353 Updated Apr 10, 2019

ruizhecao96 / CMGAN

Conformer-based Metric GAN for speech enhancement

Python 328 60 Updated May 3, 2024

naplab / Conv-TasNet

Python 290 70 Updated Feb 28, 2020

lucidrains / x-unet

Implementation of a U-net complete with efficient attention as well as the latest research findings

Python 271 20 Updated May 3, 2024

yluo42 / TAC

transform-average-concatenate (TAC) method for end-to-end microphone permutation and number invariant ad-hoc beamforming.

Python 263 54 Updated Jun 15, 2021

RookieJunChen / FullSubNet-plus

The official PyTorch implementation of "FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement".

Python 247 55 Updated Apr 23, 2024

kssteven418 / Squeezeformer

[NeurIPS'22] Squeezeformer: An Efficient Transformer for Automatic Speech Recognition

Python 245 19 Updated Feb 12, 2023

Yangruipis / ModelingPreparation

数学建模准备工作，包括一些算法的手写与调用

Python 226 53 Updated Feb 25, 2018

tencent-ailab / FRA-RIR

Python 179 27 Updated Dec 4, 2023

morriswmz / doatools.py

A simple library for theoretical research on direction-of-arrival (DOA) estimation in array signal processing.

Python 166 48 Updated Jan 28, 2021

Jiaxin-Ye / TIM-Net_SER

[ICASSP 2023] Official Tensorflow implementation of "Temporal Modeling Matters: A Novel Temporal Emotional Modeling Approach for Speech Emotion Recognition".

Python 165 25 Updated May 15, 2024

tianbot / tianracer

A meta-package for tianbot autonomous AI racecar based on nvidia development kits.

Python 126 94 Updated Dec 9, 2024

cwx-worst-one / EAT

[IJCAI 2024] EAT: Self-Supervised Pre-Training with Efficient Audio Transformer

Python 118 7 Updated Dec 23, 2024

yuguochencuc / DB-AIAT

The implementation of "Dual-branch Attention-In-Attention Transformer for single-channel speech enhancement"

Python 115 20 Updated Jun 29, 2022

kaituoxu / TasNet

A PyTorch implementation of Time-domain Audio Separation Network (TasNet) with Permutation Invariant Training (PIT) for speech separation.

Python 112 31 Updated Jan 27, 2019

yinkalario / Two-Stage-Polyphonic-Sound-Event-Detection-and-Localization

A two-stage polyphonic sound event detection and localization method for both SED and DOA.

Python 110 26 Updated Jan 8, 2023

Audio-WestlakeU / RealMAN

A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NeurIPS 2024]

Python 105 11 Updated Dec 11, 2024

Audio-WestlakeU / FN-SSL

The Official PyTorch Implementation of FN-SSL & IPDnet for Sound Source Localization [INTERSPEECH2023 & TASLP2024]

Python 97 10 Updated Dec 9, 2024

RookieJunChen / Inter-SubNet

The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.

Python 95 12 Updated May 24, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

yq1227

Block or report yq1227

Stars

WZMIAOMIAO / deep-learning-for-image-processing

xmu-xiaoma666 / External-Attention-pytorch

espnet / espnet

ai-dawang / PlugNPlay-Modules

bigmb / Unet-Segmentation-Pytorch-Nest-of-Unets

LCAV / pyroomacoustics