Skip to content
View CaA23187's full-sized avatar
  • Institute of Acoustics, Chinese Academy of Sciences
  • Beijing

Block or report CaA23187

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Audio Large Language Models

Python 326 20 Updated Jan 15, 2025

This is the PyTorch implementation of the Universal Source Separation with Weakly labelled Data.

Python 341 19 Updated Sep 1, 2023

⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。

Python 14,452 1,510 Updated Jan 21, 2025
Python 145 19 Updated Dec 5, 2024

Music repair method to convert lossy MP3 compressed music to lossless music.

Python 181 16 Updated Jan 7, 2025

[Interspeech 2024] Hold Me Tight: Stable Encoder-Decoder Design for Speech Enhancement

Python 35 1 Updated Dec 2, 2024

语音增强领域的相关数据仿真工具和方法汇总--持续更新

37 4 Updated Jul 11, 2024

Robust Speech Recognition via Large-Scale Weak Supervision

Python 74,895 8,942 Updated Jan 4, 2025

This repo hosts the code and models of "Masked Autoencoders that Listen".

Python 559 47 Updated Apr 5, 2024

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 8,332 630 Updated Jan 20, 2025

a python library for speech enhancement

Python 77 14 Updated Jun 26, 2024

The official implementation of GTCRN, an ultra-lite speech enhancement model.

Python 250 45 Updated Jan 1, 2025

Fast Diffusion Models with Transformers

Python 777 101 Updated Oct 25, 2024

A generative speech model for daily dialogue.

Python 33,838 3,669 Updated Jan 19, 2025

A pytorch quantization backend for optimum

Python 868 68 Updated Jan 10, 2025

Stable Diffusion web UI

Python 146,216 27,406 Updated Dec 28, 2024

StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation

Python 9,888 721 Updated Dec 4, 2024

This is the official implementation of our neural-network-based fast diffuse room impulse response generator (FAST-RIR) for generating room impulse responses (RIRs) for a given acoustic environment.

Python 157 29 Updated Jul 24, 2024

天涯 kkndme 神贴聊房价

18,903 3,841 Updated Aug 27, 2023

A pytorch model profiler with information about macs, energy and e.t.c

Python 13 Updated Feb 24, 2024

Deep Xi: A deep learning approach to a priori SNR estimation implemented in TensorFlow 2/Keras. For speech enhancement and robust ASR.

MATLAB 502 127 Updated Feb 17, 2022

Production first, nn-based on-device signal processing toolkit.

64 3 Updated May 30, 2023
Python 985 310 Updated Jan 21, 2025

GUI for a Vocal Remover that uses Deep Neural Networks.

Python 19,144 1,420 Updated Dec 9, 2024
Python 69 12 Updated Sep 6, 2022

SoftVC VITS Singing Voice Conversion

Python 26,375 4,899 Updated Nov 11, 2023

AudioLDM: Generate speech, sound effects, music and beyond, with text.

Python 2,532 227 Updated Dec 9, 2024

Apply diffusion models using the new Hugging Face diffusers package to synthesize music instead of images.

Jupyter Notebook 736 71 Updated Sep 25, 2024

Official Matplotlib cheat sheets

Python 7,393 899 Updated Dec 11, 2024
Next