Lists (2)
Sort Name ascending (A-Z)
Starred repositories
Matlab files for various types of beamforming
This repo contains the ENF-WHU audio recording dataset collected around Wuhan University campus and the MATLAB programs for electronic network frequency (ENF) detection, enhancement, and robust est…
[c++]STFT, iSTFT, mel-filterbank, DFT, iDFT modules
The official PyTorch implementation of "FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement".
Conditional Diffusion Probabilistic Model for Speech Enhancement
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a ca…
BDDM: Bilateral Denoising Diffusion Models for Fast and High-Quality Speech Synthesis
Spherical Microphone array Impulse Response generator (SMIRgen)
Code repository for the paper Robust Sound Source Tracking Using SRP-PHAT and 3D Convolutional Neural Networks
singing voice change based on whisper, and lora for singing voice clone
Finetuning VITS Efficiently
The most easy-to-understand tutorial for using LoRA (Low-Rank Adaptation) within diffusers framework for AI Generation Researchers🔥
Unified-Modal Speech-Text Pre-Training for Spoken Language Processing
Instruct-tune LLaMA on consumer hardware
基于PyTorch的VITS-BigVGAN的tts中文模型,加入韵律预测模型。
UniSpeech - Large Scale Self-Supervised Learning for Speech
An official implementation of "UnitSpeech: Speaker-adaptive Speech Synthesis with Untranscribed Data"
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
kuhung / bert_finetune
Forked from google-research/bert中文语料 Bert finetune(Fine-tune Chinese for BERT)
State-of-the-Art Text Embeddings
You can find the speech algorithms you want here
How to use our public wav2vec2 dimensional emotion model
Controllable and fast Text-to-Speech for over 7000 languages!