Skip to content
View alibabasglab's full-sized avatar

Block or report alibabasglab

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

Python 1 Updated Dec 12, 2024

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

Python 1,999 143 Updated Jan 14, 2025

InspireMusic: A Unified Framework for Music, Song, Audio Generation.

Python 296 26 Updated Dec 27, 2024

transform-average-concatenate (TAC) method for end-to-end microphone permutation and number invariant ad-hoc beamforming.

Python 1 Updated Jun 15, 2021

Bilingual and Code-Switching Speech Synthesis

HTML 1 Updated May 24, 2020

The Microsoft Scalable Noisy Speech Dataset (MS-SNSD) is a noisy speech dataset that can scale to arbitrary sizes depending on the number of speakers, noise types, and Speech to Noise Ratio (SNR) l…

HTML 1 Updated Apr 6, 2020

cross-lingual voice conversion

HTML 1 Updated May 24, 2020

Implementation of the Transformer variant proposed in "Transformer Quality in Linear Time"

Python 1 Updated Sep 23, 2022

A PyTorch-based Speech Toolkit

Python 1 Updated Feb 19, 2024
3 Updated Oct 15, 2024

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 1 Updated Nov 22, 2024

ClearVoice

Python 1 Updated Nov 27, 2024

qt widget app

C++ 16 11 Updated Jan 30, 2019

Define and run multi-container applications with Docker

Go 34,443 5,281 Updated Jan 14, 2025

Celery Once allows you to prevent multiple execution and queuing of celery tasks.

Python 667 93 Updated Aug 29, 2023

Turning Chat-GPT into a smart voice assistant based on speech recognition and text to speech synthesis

Python 4 Updated Jan 22, 2023

Speech Enhancement Generative Adversarial Network (SEGAN), implementation with TensorFlow 2.X

Python 4 Updated Mar 14, 2024

The project of solving Partial Differential Equations by numerical methods (Finite Difference, Finite Element, etc. Implemented in Python, Hao Zhao)

Python 6 3 Updated Oct 9, 2020

ModelScope: bring the notion of Model-as-a-Service to life.

Python 7,231 751 Updated Jan 14, 2025

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 137,577 27,567 Updated Jan 14, 2025

This is the repository for the speech enhancement model SyncFormer

9 Updated Nov 28, 2024

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 8,048 617 Updated Jan 2, 2025

This repo provides the processed samples of the manuscript "MossFormer: Pushing the Performance Limit of Monaural Speech Separation using Gated Single-head Transformer with Convolution-augmented Jo…

89 8 Updated Nov 28, 2024

This is the audio sample repository for speech separation model "MossFormer2".

Python 118 9 Updated Nov 28, 2024

AI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio a…

586 42 Updated Nov 24, 2024

This repository contains the audio samples for "D2Former: A Fully Complex Dual-Path Dual-Decoder Conformer Network using Joint Complex Masking and Complex Spectral Mapping for Monaural Speech Enhan…

Python 36 6 Updated Sep 6, 2023

Implementation of the Transformer variant proposed in "Transformer Quality in Linear Time"

Python 357 24 Updated Sep 26, 2023