-
Alibaba Group
- Singapore
-
23:43
(UTC +08:00) - [email protected]
-
GatedFormer Public
This is the repository for the speech enhancement model SyncFormer
-
MossFormer Public
This repo provides the processed samples of the manuscript "MossFormer: Pushing the Performance Limit of Monaural Speech Separation using Gated Single-head Transformer with Convolution-augmented Jo…
-
-
MossFormer2 Public
This is the audio sample repository for speech separation model "MossFormer2".
-
ClearerVoice-Studio Public
Forked from modelscope/ClearerVoice-StudioClearVoice
-
CosyVoice Public
Forked from FunAudioLLM/CosyVoiceMulti-lingual large voice generation model, providing inference, training and deployment full-stack ability.
-
-
speechbrain Public
Forked from speechbrain/speechbrainA PyTorch-based Speech Toolkit
-
-
D2Former Public
This repository contains the audio samples for "D2Former: A Fully Complex Dual-Path Dual-Decoder Conformer Network using Joint Complex Masking and Complex Spectral Mapping for Monaural Speech Enhan…
-
FLASH-pytorch Public
Forked from lucidrains/FLASH-pytorchImplementation of the Transformer variant proposed in "Transformer Quality in Linear Time"
-
TAC Public
Forked from yluo42/TACtransform-average-concatenate (TAC) method for end-to-end microphone permutation and number invariant ad-hoc beamforming.
-
-
-
MS-SNSD Public
Forked from microsoft/MS-SNSDThe Microsoft Scalable Noisy Speech Dataset (MS-SNSD) is a noisy speech dataset that can scale to arbitrary sizes depending on the number of speakers, noise types, and Speech to Noise Ratio (SNR) l…