Skip to content
View alibabasglab's full-sized avatar

Block or report alibabasglab

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
  • GatedFormer Public

    This is the repository for the speech enhancement model SyncFormer

    9 MIT License Updated Nov 28, 2024
  • MossFormer Public

    This repo provides the processed samples of the manuscript "MossFormer: Pushing the Performance Limit of Monaural Speech Separation using Gated Single-head Transformer with Convolution-augmented Jo…

    89 8 Apache License 2.0 Updated Nov 28, 2024
  • FRCRN Public

    137 12 Updated Nov 28, 2024
  • MossFormer2 Public

    This is the audio sample repository for speech separation model "MossFormer2".

    Python 118 9 MIT License Updated Nov 28, 2024
  • ClearVoice

    Python 1 Apache License 2.0 Updated Nov 27, 2024
  • CosyVoice Public

    Forked from FunAudioLLM/CosyVoice

    Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

    Python 1 Apache License 2.0 Updated Nov 22, 2024
  • cLDM-DCL Public

    3 Apache License 2.0 Updated Oct 15, 2024
  • A PyTorch-based Speech Toolkit

    Python 1 Apache License 2.0 Updated Feb 19, 2024
  • 2 Updated Jan 2, 2024
  • D2Former Public

    This repository contains the audio samples for "D2Former: A Fully Complex Dual-Path Dual-Decoder Conformer Network using Joint Complex Masking and Complex Spectral Mapping for Monaural Speech Enhan…

    Python 36 6 MIT License Updated Sep 6, 2023
  • Implementation of the Transformer variant proposed in "Transformer Quality in Linear Time"

    Python 1 MIT License Updated Sep 23, 2022
  • TAC Public

    Forked from yluo42/TAC

    transform-average-concatenate (TAC) method for end-to-end microphone permutation and number invariant ad-hoc beamforming.

    Python 1 Updated Jun 15, 2021
  • tts Public

    Bilingual and Code-Switching Speech Synthesis

    HTML 1 MIT License Updated May 24, 2020
  • vc Public

    cross-lingual voice conversion

    HTML 1 MIT License Updated May 24, 2020
  • MS-SNSD Public

    Forked from microsoft/MS-SNSD

    The Microsoft Scalable Noisy Speech Dataset (MS-SNSD) is a noisy speech dataset that can scale to arbitrary sizes depending on the number of speakers, noise types, and Speech to Noise Ratio (SNR) l…

    HTML 1 MIT License Updated Apr 6, 2020