Lhx94As

💭

I may be slow to respond.

Hexin Liu Lhx94As

💭

I may be slow to respond.

NTU Postdoc in Speech lab@NTU Language Identification & Multilingual/Code-switching ASR

35 followers · 20 following

Nanyang Technological University
Singapore
https://scholar.google.com/citations?user=iAT_5-kAAAAJ&hl=en

Achievements

Stars

129 results for source starred repositories

Clear filter

kaistmm / FlowAVSE

Python 8 2 Updated Jul 15, 2024

Tonyyouyou / Mutual-Information-Analysis

Python 3 Updated Sep 14, 2024

wutaiqiang / MoSLoRA

Python 85 8 Updated Jul 6, 2024

yangdongchao / AcademiCodec

AcademiCodec: An Open Source Audio Codec Model for Academic Research

Python 599 80 Updated Dec 27, 2023

YUCHEN005 / NASE

Code for paper "Noise-aware Speech Enhancement using Diffusion Probabilistic Model"

Python 84 2 Updated Jun 10, 2024

Tonyyouyou / Mamba-in-Speech

Python 26 1 Updated Jul 1, 2024

descriptinc / descript-audio-codec

State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.

Python 1,238 117 Updated Jul 11, 2024

chq1155 / A-Survey-on-Generative-Diffusion-Model

924 60 Updated Oct 18, 2023

FrenchKrab / IS2023-powerset-diarization

Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.

Jupyter Notebook 73 4 Updated Oct 18, 2023

mli / paper-reading

深度学习经典、新论文逐段精读

27,582 2,468 Updated Nov 17, 2024

datawhalechina / easy-rl

强化学习中文教程（蘑菇书🍄），在线阅读地址：https://datawhalechina.github.io/easy-rl/

Jupyter Notebook 9,779 1,899 Updated Nov 8, 2024

rshaojimmy / MultiModal-DeepFake

[TPAMI 2024 & CVPR 2023] PyTorch code for DGM4: Detecting and Grounding Multi-Modal Media Manipulation and beyond

Python 385 29 Updated Apr 23, 2024

BUTSpeechFIT / VBx

Variational Bayes HMM over x-vectors diarization

Python 257 57 Updated Jan 15, 2024

AIGC-Audio / AudioGPT

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

Python 10,070 868 Updated Jul 6, 2024

HuangZiliAndy / SSL_for_multitalker

ADAPTING SELF-SUPERVISED MODELS TO MULTI-TALKER SPEECH RECOGNITION USING SPEAKER EMBEDDINGS

Shell 27 1 Updated Mar 16, 2023

hyperion-ml / hyperion

Python toolkit for speech processing

Python 68 21 Updated Nov 20, 2024

k2-fsa / fast_rnnt

A torch implementation of a recursion which turns out to be useful for RNN-T.

Python 139 22 Updated Aug 25, 2023

openai / whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Python 73,291 8,749 Updated Dec 1, 2024

cyrta / awesome-speech-enhancement

A curated list of awesome Speech Enhancement papers, libraries, datasets, and other resources.

66 15 Updated Sep 9, 2019

openai / tiktoken

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 12,781 878 Updated Oct 3, 2024

speechbrain / speechbrain

A PyTorch-based Speech Toolkit

Python 9,102 1,413 Updated Dec 20, 2024

Anwarvic / Speaker-Recognition

This repo contains my attempt to create a Speaker Recognition and Verification system using SideKit-1.3.1

Python 110 32 Updated May 22, 2019

facebookresearch / encodec

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Python 3,547 310 Updated Jan 4, 2024

Lhx94As / JSTSP_w2v_for_LID

Source code for: Efficient Self-supervised Learning Representations for Spoken Language Identification

Python 4 Updated Sep 13, 2022

ldkong1205 / TranSVAE

[NeurIPS 2023] Unsupervised Video Domain Adaptation for Action Recognition: A Disentanglement Perspective

Jupyter Notebook 122 11 Updated Oct 25, 2023

zjc6666 / Accent-Recognition

Python 5 2 Updated Nov 23, 2021

wq2012 / SpeakerRecognitionFromScratch

Final project for the Speaker Recognition course on Udemy, 机器之心, 深蓝学院 and 语音之家

Python 43 14 Updated May 7, 2024

azl397985856 / leetcode

LeetCode Solutions: A Record of My Problem Solving Journey.( leetcode题解，记录自己的leetcode解题之路。)

JavaScript 54,869 9,472 Updated Dec 10, 2024

Lhx94As / PHO-LID

PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification

Python 19 2 Updated Aug 24, 2023

espnet / espnet

End-to-End Speech Processing Toolkit

Python 8,619 2,198 Updated Dec 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hexin Liu Lhx94As

Achievements

Achievements

Block or report Lhx94As

Stars

kaistmm / FlowAVSE

Tonyyouyou / Mutual-Information-Analysis

wutaiqiang / MoSLoRA

yangdongchao / AcademiCodec

YUCHEN005 / NASE

Tonyyouyou / Mamba-in-Speech

descriptinc / descript-audio-codec

chq1155 / A-Survey-on-Generative-Diffusion-Model

FrenchKrab / IS2023-powerset-diarization

mli / paper-reading

datawhalechina / easy-rl

rshaojimmy / MultiModal-DeepFake

BUTSpeechFIT / VBx

AIGC-Audio / AudioGPT

HuangZiliAndy / SSL_for_multitalker

hyperion-ml / hyperion

k2-fsa / fast_rnnt

openai / whisper

cyrta / awesome-speech-enhancement

openai / tiktoken

speechbrain / speechbrain

Anwarvic / Speaker-Recognition

facebookresearch / encodec

Lhx94As / JSTSP_w2v_for_LID

ldkong1205 / TranSVAE

zjc6666 / Accent-Recognition

wq2012 / SpeakerRecognitionFromScratch

azl397985856 / leetcode

Lhx94As / PHO-LID

espnet / espnet