-
Alibaba Group
- Singapore
-
00:01
(UTC +08:00) - [email protected]
Lists (3)
Sort Name ascending (A-Z)
Stars
An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.
An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.
InspireMusic: A Unified Framework for Music, Song, Audio Generation.
alibabasglab / TAC
Forked from yluo42/TACtransform-average-concatenate (TAC) method for end-to-end microphone permutation and number invariant ad-hoc beamforming.
alibabasglab / MS-SNSD
Forked from microsoft/MS-SNSDThe Microsoft Scalable Noisy Speech Dataset (MS-SNSD) is a noisy speech dataset that can scale to arbitrary sizes depending on the number of speakers, noise types, and Speech to Noise Ratio (SNR) l…
alibabasglab / FLASH-pytorch
Forked from lucidrains/FLASH-pytorchImplementation of the Transformer variant proposed in "Transformer Quality in Linear Time"
alibabasglab / speechbrain
Forked from speechbrain/speechbrainA PyTorch-based Speech Toolkit
alibabasglab / CosyVoice
Forked from FunAudioLLM/CosyVoiceMulti-lingual large voice generation model, providing inference, training and deployment full-stack ability.
ClearVoice
Define and run multi-container applications with Docker
Celery Once allows you to prevent multiple execution and queuing of celery tasks.
Turning Chat-GPT into a smart voice assistant based on speech recognition and text to speech synthesis
Speech Enhancement Generative Adversarial Network (SEGAN), implementation with TensorFlow 2.X
The project of solving Partial Differential Equations by numerical methods (Finite Difference, Finite Element, etc. Implemented in Python, Hao Zhao)
ModelScope: bring the notion of Model-as-a-Service to life.
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
This is the repository for the speech enhancement model SyncFormer
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…
This repo provides the processed samples of the manuscript "MossFormer: Pushing the Performance Limit of Monaural Speech Separation using Gated Single-head Transformer with Convolution-augmented Jo…
This is the audio sample repository for speech separation model "MossFormer2".
AI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio a…
This repository contains the audio samples for "D2Former: A Fully Complex Dual-Path Dual-Decoder Conformer Network using Joint Complex Masking and Complex Spectral Mapping for Monaural Speech Enhan…
Implementation of the Transformer variant proposed in "Transformer Quality in Linear Time"