-
Tencent
- Shanghai
-
07:35
- 8h ahead
Stars
The implementation of "X-TF-GridNet: A Time-Frequency Domain Target Speaker Extraction Network with Adaptive Speaker Embedding Fusion", which is accepted by Information Fusion.
Official Jax Implementation of MaskGIT
Real-Time De-noising and De-reverbing with Tiny Recurrent UNet
The personal information dashboard for your terminal
Official repository of Spiking-FullSubNet, the Intel N-DNS Challenge Algorithmic Track Winner.
The official pytorch implemention of the Intespeech 2024 paper "Reshape Dimensions Network for Speaker Recognition"
经济学人(含音频)、纽约客、卫报、连线、大西洋月刊等英语杂志免费下载,支持epub、mobi、pdf格式, 每周更新
Speaker verification evaluation protocols simulating speaker diarisation
BAE-NET: A LOW COMPLEXITY AND HIGH FIDELITY BANDWIDTH-ADAPTIVE NEURAL NETWORK FOR SPEECH SUPER-RESOLUTION
Speech Enhancement Metrics (PESQ, CSIG, CBAK, COVL)
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
TinyNeuralNetwork is an efficient and easy-to-use deep learning model compression framework.
VOICOR: A Residual Iterative Voice Correction Framework for Monaural Speech Enhancement
This repo contains the official PyTorch implementation of "Audio Super Resolution in the Spectral Domain" (ICASSP 2023)
vits2 backbone with multilingual-bert
Voice activity detection (VAD) library, based on WebRTC's VAD engine
A list of publicly available room impulse response datasets and scripts to download them.
wsj0-{2, 3, 4, 5} mix generation scripts, in Python.
Official implementation of "Separate Anything You Describe"
Generating sensor signals in isotropic noise fields
For students who would like to apply for RA, PhD, postdoc in audio research.
Implementation of Band Split Roformer, SOTA Attention network for music source separation out of ByteDance AI Labs