Skip to content
View raymondxyy's full-sized avatar

Block or report raymondxyy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Polyphonic Sound Detection Score (PSDS)

Python 7 14 Updated Jan 20, 2020
TypeScript 166 92 Updated Jan 6, 2025

ModelScope: bring the notion of Model-as-a-Service to life.

Python 7,199 749 Updated Jan 7, 2025

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

Python 2,077 322 Updated Nov 14, 2023

Code for the paper "Language Models are Unsupervised Multitask Learners"

Python 22,751 5,555 Updated Aug 14, 2024

AEC Challenge

392 130 Updated Jun 4, 2024

Acoustic Echo Cancellation with Nerual Kalman Filtering

HTML 260 63 Updated Feb 21, 2023

This repo contains the official PyTorch implementation of "A Systematic Comparison of Phonetic Aware Techniques for Speech Enhancement" (Interspeech 2022)

Python 28 2 Updated Aug 8, 2022

A Python implementation of the Speech Intelligibility Index

Python 41 2 Updated Jul 13, 2023

Collection of audio-focused loss functions in PyTorch

Python 755 67 Updated Jul 30, 2024

Clarity Challenge toolkit - software for building Clarity Challenge systems

Python 142 54 Updated Jan 7, 2025

【编程随想】整理的《太子党关系网络》,专门揭露赵国的权贵

Python 13,638 2,759 Updated Aug 1, 2021

Robust Speech Recognition via Large-Scale Weak Supervision

Python 73,945 8,837 Updated Jan 4, 2025

A Whiteboard editor for Standard Notes based on TLDRAW

SCSS 36 3 Updated Jan 11, 2023

Simple text to phones converter for multiple languages

Python 1,261 176 Updated Sep 26, 2024
Python 49 15 Updated Jun 4, 2022

Multi-Phase Gammatone Filterbank (MP-GTF) construction for Python

Python 46 7 Updated Apr 30, 2020

Articulation Band Correlation Modified Rhyme Test

MATLAB 4 Updated Jun 23, 2021

A python library for working with praat, textgrids, time aligned audio transcripts, and audio files. It is primarily used for extracting features from and making manipulations on audio files given …

Python 320 33 Updated Dec 10, 2023

Command line utility for forced alignment using Kaldi

Python 1,383 251 Updated Dec 2, 2024

General Speech Restoration

Python 276 56 Updated Jan 13, 2024
Python 36 9 Updated Jun 22, 2022

VIP cheatsheets for Stanford's CS 221 Artificial Intelligence

2,633 507 Updated Dec 17, 2019

Implementing Stand-Alone Self-Attention in Vision Models using Pytorch

Python 455 83 Updated Feb 13, 2020

Deep Xi: A deep learning approach to a priori SNR estimation implemented in TensorFlow 2/Keras. For speech enhancement and robust ASR.

MATLAB 502 127 Updated Feb 17, 2022

A PyTorch-based Speech Toolkit

Python 9,151 1,415 Updated Jan 7, 2025

PyTorch Lightning + Hydra. A very user-friendly template for ML experimentation. ⚡🔥⚡

Python 4,376 664 Updated Aug 16, 2024

Deep-Learning-Based Audio-Visual Speech Enhancement and Separation

205 22 Updated Apr 16, 2023

Official repo for "A MODULATION-DOMAIN LOSS FOR NEURAL-NETWORK-BASED REAL-TIME SPEECH ENHANCEMENT" to appear in ICASSP 2021

Jupyter Notebook 38 4 Updated Oct 14, 2021

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 30,773 6,444 Updated Oct 18, 2024
Next