CaA23187

Follow

Kuang Kelan CaA23187

Follow

Speech Enhancement, Array Signal Processing. E-mail: [email protected]

24 followers · 35 following

Institute of Acoustics, Chinese Academy of Sciences
Beijing

Achievements

Achievements

Stars

AudioLLMs / Awesome-Audio-LLM

Audio Large Language Models

Python 326 20 Updated Jan 15, 2025

bytedance / uss

This is the PyTorch implementation of the Universal Source Separation with Weakly labelled Data.

Python 341 19 Updated Sep 1, 2023

Zeyi-Lin / HivisionIDPhotos

⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。

Python 14,452 1,510 Updated Jan 21, 2025

JusperLee / SPMamba

Python 145 19 Updated Dec 5, 2024

JusperLee / Apollo

Music repair method to convert lossy MP3 compressed music to lossless music.

Python 181 16 Updated Jan 7, 2025

felixperfler / Stable-Hybrid-Auditory-Filterbanks

[Interspeech 2024] Hold Me Tight: Stable Encoder-Decoder Design for Speech Enhancement

Python 35 1 Updated Dec 2, 2024

YoungJay0612 / Speech-Simulation-Tools

语音增强领域的相关数据仿真工具和方法汇总--持续更新

37 4 Updated Jul 11, 2024

openai / whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Python 74,895 8,942 Updated Jan 4, 2025

facebookresearch / AudioMAE

This repo hosts the code and models of "Masked Autoencoders that Listen".

Python 559 47 Updated Apr 5, 2024

open-mmlab / Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 8,332 630 Updated Jan 20, 2025

chenwj1989 / python-speech-enhancement

a python library for speech enhancement

Python 77 14 Updated Jun 26, 2024

cszheng-ioa / Sixty-years-of-frequency-domain-monaural-speech-enhancement

Python 135 27 Updated Jan 30, 2024

Xiaobin-Rong / gtcrn

The official implementation of GTCRN, an ultra-lite speech enhancement model.

Python 250 45 Updated Jan 1, 2025

chuanyangjin / fast-DiT

Fast Diffusion Models with Transformers

Python 777 101 Updated Oct 25, 2024

2noise / ChatTTS

A generative speech model for daily dialogue.

Python 33,838 3,669 Updated Jan 19, 2025

huggingface / optimum-quanto

A pytorch quantization backend for optimum

Python 868 68 Updated Jan 10, 2025

AUTOMATIC1111 / stable-diffusion-webui

Stable Diffusion web UI

Python 146,216 27,406 Updated Dec 28, 2024

cumulo-autumn / StreamDiffusion

StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation

Python 9,888 721 Updated Dec 4, 2024

anton-jeran / FAST-RIR

This is the official implementation of our neural-network-based fast diffuse room impulse response generator (FAST-RIR) for generating room impulse responses (RIRs) for a given acoustic environment.

Python 157 29 Updated Jul 24, 2024

shengcaishizhan / kkndme_tianya

天涯 kkndme 神贴聊房价

18,903 3,841 Updated Aug 27, 2023

HaoKang-Timmy / torchanalyse

A pytorch model profiler with information about macs, energy and e.t.c

Python 13 Updated Feb 24, 2024

anicolson / DeepXi

Deep Xi: A deep learning approach to a priori SNR estimation implemented in TensorFlow 2/Keras. For speech enhancement and robust ASR.

MATLAB 502 127 Updated Feb 17, 2022

wenet-e2e / wesignal

Production first, nn-based on-device signal processing toolkit.

64 3 Updated May 30, 2023

k2-fsa / icefall

Python 985 310 Updated Jan 21, 2025

Anjok07 / ultimatevocalremovergui

GUI for a Vocal Remover that uses Deep Neural Networks.

Python 19,144 1,420 Updated Dec 9, 2024

Qinwen-Hu / dparn

Python 69 12 Updated Sep 6, 2022

svc-develop-team / so-vits-svc

SoftVC VITS Singing Voice Conversion

Python 26,375 4,899 Updated Nov 11, 2023

haoheliu / AudioLDM

AudioLDM: Generate speech, sound effects, music and beyond, with text.

Python 2,532 227 Updated Dec 9, 2024

teticio / audio-diffusion

Apply diffusion models using the new Hugging Face diffusers package to synthesize music instead of images.

Jupyter Notebook 736 71 Updated Sep 25, 2024

matplotlib / cheatsheets

Official Matplotlib cheat sheets

Python 7,393 899 Updated Dec 11, 2024