Skip to content
View CaA23187's full-sized avatar
  • Institute of Acoustics, Chinese Academy of Sciences
  • Beijing

Block or report CaA23187

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
72 stars written in Python
Clear filter

Stable Diffusion web UI

Python 146,709 27,464 Updated Jan 30, 2025

Robust Speech Recognition via Large-Scale Weak Supervision

Python 75,431 9,021 Updated Jan 4, 2025

《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。

Python 65,423 11,215 Updated Jul 30, 2024

A generative speech model for daily dialogue.

Python 34,042 3,688 Updated Jan 25, 2025

12306智能刷票,订票

Python 34,007 9,795 Updated Apr 2, 2023

SoftVC VITS Singing Voice Conversion

Python 26,422 4,901 Updated Nov 11, 2023

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 20,657 2,589 Updated Jan 7, 2025

GUI for a Vocal Remover that uses Deep Neural Networks.

Python 19,255 1,423 Updated Dec 9, 2024

Fast and memory-efficient exact attention

Python 15,259 1,441 Updated Jan 30, 2025

⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。

Python 14,568 1,516 Updated Jan 21, 2025

StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation

Python 9,902 726 Updated Dec 4, 2024

A PyTorch-based Speech Toolkit

Python 9,289 1,426 Updated Feb 1, 2025

Code for the paper Hybrid Spectrogram and Waveform Source Separation

Python 8,596 1,113 Updated Apr 24, 2024

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 8,405 640 Updated Jan 23, 2025

Official Matplotlib cheat sheets

Python 7,393 900 Updated Dec 11, 2024

Flops counter for convolutional networks in pytorch framework

Python 2,854 307 Updated Jan 20, 2025

Noise supression using deep filtering

Python 2,704 251 Updated Oct 17, 2024

AudioLDM: Generate speech, sound effects, music and beyond, with text.

Python 2,540 227 Updated Dec 9, 2024

[IEEE TMI] Official Implementation for UNet++

Python 2,362 546 Updated Jan 11, 2025

mobilenetv3 with pytorch,provide pre-train model

Python 1,681 342 Updated Apr 27, 2023

Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.

Python 1,504 437 Updated Jan 3, 2025

Awesome Lists for Tenure-Track Assistant Professors and PhD students. (助理教授/博士生生存指南)

Python 1,497 87 Updated Feb 1, 2024

SincNet is a neural architecture for efficiently processing raw audio samples.

Python 1,147 263 Updated Apr 28, 2021
Python 992 309 Updated Jan 27, 2025

[Unofficial] PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)

Python 989 180 Updated Dec 22, 2023

A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR

Python 936 161 Updated Jul 5, 2023

A pytorch quantization backend for optimum

Python 870 68 Updated Jan 10, 2025

[CVPR 2022] Official implementation of the paper "Uformer: A General U-Shaped Transformer for Image Restoration".

Python 827 118 Updated Oct 24, 2024

Fast Diffusion Models with Transformers

Python 778 102 Updated Oct 25, 2024
Next