Eastforward

Eastforward

Welcome!

4 followers · 9 following

Stars

shivammehta25 / Matcha-TTS

[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching

Jupyter Notebook 796 100 Updated Dec 24, 2024

AudioLLMs / AudioBench

AudioBench: A Universal Benchmark for Audio Large Language Models

Python 106 1 Updated Dec 14, 2024

facebookresearch / audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Python 21,225 2,186 Updated Nov 11, 2024

modelscope / ClearerVoice-Studio

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

Python 1,876 132 Updated Dec 25, 2024

tatsu-lab / stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 29,691 4,053 Updated Jul 17, 2024

dmlguq456 / SepReformer

Official repository of SepReformer for speech separation

Python 160 14 Updated Dec 18, 2024

happylittlecat2333 / Auffusion

Official codes and models of the paper "Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generation"

Jupyter Notebook 166 13 Updated Mar 25, 2024

vietnh1009 / ASCII-generator

ASCII generator (image to text, image to image, video to video)

Python 7,528 572 Updated Nov 22, 2024

facebookresearch / fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 30,733 6,433 Updated Oct 18, 2024

gabrielmittag / NISQA

NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment

Python 711 119 Updated Dec 1, 2024

alessandroragano / scoreq

SCOREQ: Speech COntrastive REgression for Quality Assessment (NeurIPS 2024)

Python 61 4 Updated Dec 4, 2024

facebookresearch / Noresqa

This github repo is for Neurips 2021 and Interspeech 2022 papers on Non-Matching Reference based estimation of speech quality assessment.

Python 94 13 Updated May 9, 2023

JasonSWFu / VQscore

Python 40 5 Updated Dec 2, 2024

google / visqol

Perceptual Quality Estimator for speech and audio

C++ 719 127 Updated Aug 2, 2024

wnlen / clash-for-linux

clash-for-linux

Shell 1,709 588 Updated Dec 12, 2023

liuxubo717 / LASS

This repo hosts the code and model of "Separate What You Describe: Language-Queried Audio Source Separation", Interspeech 2022

Python 141 8 Updated Oct 11, 2023

Toki3ki / PointNet-PyTrain-CudaInfer

The project uses Python to implement the PointNet training process, while leveraging GPU acceleration, C++, and CUDA for efficient inference.

Cuda 2 Updated Nov 13, 2024

sp-uhh / sgmse

Score-based Generative Models (Diffusion Models) for Speech Enhancement and Dereverberation

Python 546 76 Updated Dec 21, 2024

cwang621 / blsp

BLSP: Bootstrapping Langauge-Speech Pre-training via Behavior Alignment of Continuation Writing

Python 46 10 Updated Mar 11, 2024

applenana / AP-AMS

一个简单的适用于拓竹的自动换色系统

C++ 207 49 Updated Nov 27, 2024

HaoFengyuan / X-TF-GridNet

The implementation of "X-TF-GridNet: A Time-Frequency Domain Target Speaker Extraction Network with Adaptive Speaker Embedding Fusion", which is accepted by Information Fusion.

Python 42 5 Updated Oct 17, 2024