Skip to content
View Lhx94As's full-sized avatar
💭
I may be slow to respond.
💭
I may be slow to respond.

Block or report Lhx94As

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
129 results for source starred repositories
Clear filter
Python 8 2 Updated Jul 15, 2024
Python 85 8 Updated Jul 6, 2024

AcademiCodec: An Open Source Audio Codec Model for Academic Research

Python 599 80 Updated Dec 27, 2023

Code for paper "Noise-aware Speech Enhancement using Diffusion Probabilistic Model"

Python 84 2 Updated Jun 10, 2024
Python 26 1 Updated Jul 1, 2024

State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.

Python 1,238 117 Updated Jul 11, 2024

Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.

Jupyter Notebook 73 4 Updated Oct 18, 2023

深度学习经典、新论文逐段精读

27,582 2,468 Updated Nov 17, 2024

强化学习中文教程(蘑菇书🍄),在线阅读地址:https://datawhalechina.github.io/easy-rl/

Jupyter Notebook 9,779 1,899 Updated Nov 8, 2024

[TPAMI 2024 & CVPR 2023] PyTorch code for DGM4: Detecting and Grounding Multi-Modal Media Manipulation and beyond

Python 385 29 Updated Apr 23, 2024

Variational Bayes HMM over x-vectors diarization

Python 257 57 Updated Jan 15, 2024

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

Python 10,070 868 Updated Jul 6, 2024

ADAPTING SELF-SUPERVISED MODELS TO MULTI-TALKER SPEECH RECOGNITION USING SPEAKER EMBEDDINGS

Shell 27 1 Updated Mar 16, 2023

Python toolkit for speech processing

Python 68 21 Updated Nov 20, 2024

A torch implementation of a recursion which turns out to be useful for RNN-T.

Python 139 22 Updated Aug 25, 2023

Robust Speech Recognition via Large-Scale Weak Supervision

Python 73,291 8,749 Updated Dec 1, 2024

A curated list of awesome Speech Enhancement papers, libraries, datasets, and other resources.

66 15 Updated Sep 9, 2019

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 12,781 878 Updated Oct 3, 2024

A PyTorch-based Speech Toolkit

Python 9,102 1,413 Updated Dec 20, 2024

This repo contains my attempt to create a Speaker Recognition and Verification system using SideKit-1.3.1

Python 110 32 Updated May 22, 2019

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Python 3,547 310 Updated Jan 4, 2024

Source code for: Efficient Self-supervised Learning Representations for Spoken Language Identification

Python 4 Updated Sep 13, 2022

[NeurIPS 2023] Unsupervised Video Domain Adaptation for Action Recognition: A Disentanglement Perspective

Jupyter Notebook 122 11 Updated Oct 25, 2023
Python 5 2 Updated Nov 23, 2021

Final project for the Speaker Recognition course on Udemy, 机器之心, 深蓝学院 and 语音之家

Python 43 14 Updated May 7, 2024

LeetCode Solutions: A Record of My Problem Solving Journey.( leetcode题解,记录自己的leetcode解题之路。)

JavaScript 54,869 9,472 Updated Dec 10, 2024

PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification

Python 19 2 Updated Aug 24, 2023

End-to-End Speech Processing Toolkit

Python 8,619 2,198 Updated Dec 23, 2024
Next