Skip to content
View Eastforward's full-sized avatar

Block or report Eastforward

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching

Jupyter Notebook 796 100 Updated Dec 24, 2024

AudioBench: A Universal Benchmark for Audio Large Language Models

Python 106 1 Updated Dec 14, 2024

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Python 21,225 2,186 Updated Nov 11, 2024

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

Python 1,876 132 Updated Dec 25, 2024

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 29,691 4,053 Updated Jul 17, 2024

Official repository of SepReformer for speech separation

Python 160 14 Updated Dec 18, 2024

Official codes and models of the paper "Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generation"

Jupyter Notebook 166 13 Updated Mar 25, 2024

ASCII generator (image to text, image to image, video to video)

Python 7,528 572 Updated Nov 22, 2024

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 30,733 6,433 Updated Oct 18, 2024

NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment

Python 711 119 Updated Dec 1, 2024

SCOREQ: Speech COntrastive REgression for Quality Assessment (NeurIPS 2024)

Python 61 4 Updated Dec 4, 2024

This github repo is for Neurips 2021 and Interspeech 2022 papers on Non-Matching Reference based estimation of speech quality assessment.

Python 94 13 Updated May 9, 2023
Python 40 5 Updated Dec 2, 2024

Perceptual Quality Estimator for speech and audio

C++ 719 127 Updated Aug 2, 2024

clash-for-linux

Shell 1,709 588 Updated Dec 12, 2023

This repo hosts the code and model of "Separate What You Describe: Language-Queried Audio Source Separation", Interspeech 2022

Python 141 8 Updated Oct 11, 2023

The project uses Python to implement the PointNet training process, while leveraging GPU acceleration, C++, and CUDA for efficient inference.

Cuda 2 Updated Nov 13, 2024

Score-based Generative Models (Diffusion Models) for Speech Enhancement and Dereverberation

Python 546 76 Updated Dec 21, 2024

BLSP: Bootstrapping Langauge-Speech Pre-training via Behavior Alignment of Continuation Writing

Python 46 10 Updated Mar 11, 2024

一个简单的适用于拓竹的自动换色系统

C++ 207 49 Updated Nov 27, 2024

The implementation of "X-TF-GridNet: A Time-Frequency Domain Target Speaker Extraction Network with Adaptive Speaker Embedding Fusion", which is accepted by Information Fusion.

Python 42 5 Updated Oct 17, 2024
Python 7,054 550 Updated Dec 20, 2024

✨✨Latest Advances on Multimodal Large Language Models

13,274 840 Updated Dec 26, 2024

✨✨VITA: Towards Open-Source Interactive Omni Multimodal LLM

Python 1,080 64 Updated Dec 27, 2024

Analytic Class Incremental Learning for Sound Source Localization with Privacy Protection

Python 1 Updated Sep 14, 2024

爬取雨课堂答案

Python 11 Updated Nov 21, 2024

基于Python的雨课堂线上课划水小助手

Python 59 9 Updated Sep 1, 2022

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

5,926 324 Updated Dec 27, 2024

PointNet and PointNet++ implemented by pytorch (pure python) and on ModelNet, ShapeNet and S3DIS.

Python 3,812 917 Updated Apr 24, 2024

PyTorch implementation for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral)

Jupyter Notebook 1,793 324 Updated Jul 14, 2024
Next