alibabasglab

Shengkui Zhao alibabasglab

77 followers · 106 following

Alibaba Group
Singapore
00:01 (UTC +08:00)
[email protected]

Achievements

Lists (3)

Sort

Stars

vishwamartur / ClearerVoice-Studio

Forked from modelscope/ClearerVoice-Studio

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

Python 1 Updated Dec 12, 2024

modelscope / ClearerVoice-Studio

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

Python 1,999 143 Updated Jan 14, 2025

FunAudioLLM / InspireMusic

InspireMusic: A Unified Framework for Music, Song, Audio Generation.

Python 296 26 Updated Dec 27, 2024

alibabasglab / TAC

Forked from yluo42/TAC

transform-average-concatenate (TAC) method for end-to-end microphone permutation and number invariant ad-hoc beamforming.

Python 1 Updated Jun 15, 2021

alibabasglab / tts

Bilingual and Code-Switching Speech Synthesis

HTML 1 Updated May 24, 2020

alibabasglab / MS-SNSD

Forked from microsoft/MS-SNSD

The Microsoft Scalable Noisy Speech Dataset (MS-SNSD) is a noisy speech dataset that can scale to arbitrary sizes depending on the number of speakers, noise types, and Speech to Noise Ratio (SNR) l…

HTML 1 Updated Apr 6, 2020

alibabasglab / vc

cross-lingual voice conversion

HTML 1 Updated May 24, 2020

alibabasglab / FLASH-pytorch

Forked from lucidrains/FLASH-pytorch

Implementation of the Transformer variant proposed in "Transformer Quality in Linear Time"

Python 1 Updated Sep 23, 2022

alibabasglab / fig_resources

2 Updated Jan 2, 2024

alibabasglab / speechbrain

Forked from speechbrain/speechbrain

A PyTorch-based Speech Toolkit

Python 1 Updated Feb 19, 2024

alibabasglab / cLDM-DCL

3 Updated Oct 15, 2024

alibabasglab / CosyVoice

Forked from FunAudioLLM/CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 1 Updated Nov 22, 2024

alibabasglab / ClearerVoice-Studio

Forked from modelscope/ClearerVoice-Studio

ClearVoice

Python 1 Updated Nov 27, 2024

0TemetNosce0 / Qt-widgets-app

qt widget app

C++ 16 11 Updated Jan 30, 2019

docker / compose

Define and run multi-container applications with Docker

Go 34,443 5,281 Updated Jan 14, 2025

cameronmaske / celery-once

Celery Once allows you to prevent multiple execution and queuing of celery tasks.

Python 667 93 Updated Aug 29, 2023

haozh7109 / ChatGPT_voice-interaction

Turning Chat-GPT into a smart voice assistant based on speech recognition and text to speech synthesis

Python 4 Updated Jan 22, 2023

haozh7109 / SEGAN-TensorFlow2

Speech Enhancement Generative Adversarial Network (SEGAN), implementation with TensorFlow 2.X

Python 4 Updated Mar 14, 2024

haozh7109 / Numerical-methods-for-solving-partial-differential-equations-project

The project of solving Partial Differential Equations by numerical methods (Finite Difference, Finite Element, etc. Implemented in Python, Hao Zhao)

Python 6 3 Updated Oct 9, 2020

modelscope / modelscope

ModelScope: bring the notion of Model-as-a-Service to life.

Python 7,231 751 Updated Jan 14, 2025

huggingface / transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 137,577 27,567 Updated Jan 14, 2025

alibabasglab / GatedFormer

This is the repository for the speech enhancement model SyncFormer

9 Updated Nov 28, 2024

open-mmlab / Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 8,048 617 Updated Jan 2, 2025

alibabasglab / MossFormer

This repo provides the processed samples of the manuscript "MossFormer: Pushing the Performance Limit of Monaural Speech Separation using Gated Single-head Transformer with Convolution-augmented Jo…

89 8 Updated Nov 28, 2024

alibabasglab / MossFormer2

This is the audio sample repository for speech separation model "MossFormer2".

Python 118 9 Updated Nov 28, 2024

Yuan-ManX / ai-audio-datasets

AI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio a…

586 42 Updated Nov 24, 2024

alibabasglab / D2Former

This repository contains the audio samples for "D2Former: A Fully Complex Dual-Path Dual-Decoder Conformer Network using Joint Complex Masking and Complex Spectral Mapping for Monaural Speech Enhan…

Python 36 6 Updated Sep 6, 2023

lucidrains / FLASH-pytorch

Implementation of the Transformer variant proposed in "Transformer Quality in Linear Time"

Python 357 24 Updated Sep 26, 2023

alibabasglab / FRCRN

137 12 Updated Nov 28, 2024

Shengkui Zhao alibabasglab

Lists (3)

🔮 Future ideas

✨ Inspiration

🚀 My stack

Stars