Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 8,405 640 Updated Jan 23, 2025

matplotlib / cheatsheets

Official Matplotlib cheat sheets

Python 7,393 900 Updated Dec 11, 2024

sovrasov / flops-counter.pytorch

Flops counter for convolutional networks in pytorch framework

Python 2,854 307 Updated Jan 20, 2025

Rikorose / DeepFilterNet

Noise supression using deep filtering

Python 2,704 251 Updated Oct 17, 2024

haoheliu / AudioLDM

AudioLDM: Generate speech, sound effects, music and beyond, with text.

Python 2,540 227 Updated Dec 9, 2024

MrGiovanni / UNetPlusPlus

[IEEE TMI] Official Implementation for UNet++

Python 2,362 546 Updated Jan 11, 2025

xiaolai-sqlai / mobilenetv3

mobilenetv3 with pytorch，provide pre-train model

Python 1,681 342 Updated Apr 27, 2023

LCAV / pyroomacoustics

Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.

Python 1,504 437 Updated Jan 3, 2025

JunweiLiang / awesome_lists

Awesome Lists for Tenure-Track Assistant Professors and PhD students. (助理教授/博士生生存指南)

Python 1,497 87 Updated Feb 1, 2024

mravanelli / SincNet

SincNet is a neural architecture for efficiently processing raw audio samples.

Python 1,147 263 Updated Apr 28, 2021

k2-fsa / icefall

Python 992 309 Updated Jan 27, 2025

sooftware / conformer

[Unofficial] PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)

Python 989 180 Updated Dec 22, 2023

aliutkus / speechmetrics

A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR

Python 936 161 Updated Jul 5, 2023

huggingface / optimum-quanto

A pytorch quantization backend for optimum

Python 870 68 Updated Jan 10, 2025

ZhendongWang6 / Uformer

[CVPR 2022] Official implementation of the paper "Uformer: A General U-Shaped Transformer for Image Restoration".

Python 827 118 Updated Oct 24, 2024

chuanyangjin / fast-DiT

Fast Diffusion Models with Transformers

Python 778 102 Updated Oct 25, 2024

google-research / sound-separation

Python 659 118 Updated Oct 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Kuang Kelan CaA23187

Achievements

Achievements

Block or report CaA23187

Stars

AUTOMATIC1111 / stable-diffusion-webui

openai / whisper

d2l-ai / d2l-zh

2noise / ChatTTS

testerSunshine / 12306

svc-develop-team / so-vits-svc

microsoft / unilm

Anjok07 / ultimatevocalremovergui

Dao-AILab / flash-attention

Zeyi-Lin / HivisionIDPhotos

cumulo-autumn / StreamDiffusion

speechbrain / speechbrain

facebookresearch / demucs

open-mmlab / Amphion