Official Pytorch Implementation for "DDDM-VC: Decoupled Denoising Diffusion Models with Disentangled Representation and Prior Mixup for Verified Robust Voice Conversion" (AAAI 2024)

Python 210 22 Updated Jul 31, 2024

Plachtaa / seed-vc

zero-shot voice conversion & singing voice conversion, with real-time support

Python 1,108 133 Updated Feb 18, 2025

state-spaces / mamba

Mamba SSM architecture

Python 14,055 1,225 Updated Jan 18, 2025

BakerBunker / FreeV

[InterSpeech 24] FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter

Python 86 7 Updated Jul 4, 2024

hluwa / frida-dexdump

A frida tool to dump dex in memory to support security engineers analyzing malware.

Python 4,124 913 Updated Mar 4, 2023

QwenLM / Qwen-Audio

The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,606 115 Updated Jul 5, 2024

CopyPlusPlus / CopyPlusPlus

让复制更加简单！

C# 918 62 Updated Feb 27, 2023

ddlBoJack / Speech-Resources

语音方向实验室/公司/资源/实习等，欢迎推荐或自荐

544 68 Updated Nov 13, 2024

Plachtaa / VALL-E-X

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/

Python 7,804 774 Updated Feb 11, 2024

X-LANCE / SLAM-LLM

Speech, Language, Audio, Music Processing with Large Language Model

Python 729 68 Updated Feb 9, 2025

CS-BAOYAN / CSSummerCamp2024

2024年计算机保研夏令营&冬令营通知

1,575 113 Updated Jul 18, 2024

lifeiteng / vall-e

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

Python 2,091 323 Updated Nov 14, 2023

GitYCC / g2pW

Chinese Mandarin Grapheme-to-Phoneme Converter. 中文轉注音或拼音 (INTERSPEECH 2022)

Python 305 37 Updated Oct 20, 2024

MontrealCorpusTools / Montreal-Forced-Aligner

Command line utility for forced alignment using Kaldi

Python 1,405 252 Updated Dec 2, 2024

G-Meteor / Forced-Alignment-MFA

Forced Alignment-MFA

37 1 Updated Jun 13, 2022

yuboona / some-script-to-help-using-Montreal-Forced-Aligner

Some script for helping using Montreal Forced Aligner, maily for transforming Hanzi character to pinyin and extrat pause time from .textgrid files.

Python 15 1 Updated Feb 9, 2024

resemble-ai / Resemblyzer

A python package to analyze and compare voices with deep learning

Python 2,861 440 Updated Oct 12, 2023

smtiitm / Fastspeech2_MFA

Indic TTS for Indian Languages: This is a project on developing text-to-speech (TTS) synthesis systems for Indian languages, improving quality of synthesis, as well as small foot print TTS integrat…

Perl 15 8 Updated Feb 9, 2024

yzhao062 / pyod

A Python Library for Outlier and Anomaly Detection, Integrating Classical and Deep Learning Techniques

Python 8,868 1,390 Updated Jan 13, 2025

Edresson / YourTTS

YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone

Jupyter Notebook 944 82 Updated Nov 4, 2024

huggingface / diffusers

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.

Python 27,675 5,687 Updated Feb 25, 2025

riffusion / riffusion-hobby

Stable diffusion for real-time music generation

Python 3,542 411 Updated Jul 22, 2024

LAION-AI / audio-dataset

Audio Dataset for training CLAP and other models

Python 666 55 Updated Feb 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bing0 Bing0cv

Block or report Bing0cv

Stars

NZqian / Freestyler

tencent-ailab / MuQ

ASLP-lab / OSUM

multimodal-art-projection / YuE

NZqian / RapBank

mir-aidj / all-in-one

kkksuper / kkcode

hayeong0 / DDDM-VC