Lhx94As

💭

I may be slow to respond.

Hexin Liu Lhx94As

💭

I may be slow to respond.

NTU Postdoc in Speech lab@NTU Language Identification & Multilingual/Code-switching ASR

35 followers · 20 following

Nanyang Technological University
Singapore
https://scholar.google.com/citations?user=iAT_5-kAAAAJ&hl=en

Achievements

Stars

130 results for source starred repositories

Clear filter

yeyupiaoling / Whisper-Finetune

Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training without speech data. Accelerate inference and support Web deplo…

C 932 153 Updated Dec 24, 2024

kaistmm / FlowAVSE

Python 8 2 Updated Jul 15, 2024

Tonyyouyou / Mutual-Information-Analysis

Python 3 Updated Sep 14, 2024

wutaiqiang / MoSLoRA

Python 91 9 Updated Jul 6, 2024

yangdongchao / AcademiCodec

AcademiCodec: An Open Source Audio Codec Model for Academic Research

Python 607 80 Updated Dec 27, 2023

YUCHEN005 / NASE

Code for paper "Noise-aware Speech Enhancement using Diffusion Probabilistic Model"

Python 84 2 Updated Jun 10, 2024

Tonyyouyou / Mamba-in-Speech

Python 27 2 Updated Jul 1, 2024

descriptinc / descript-audio-codec

State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.

Python 1,248 117 Updated Jul 11, 2024

chq1155 / A-Survey-on-Generative-Diffusion-Model

925 60 Updated Oct 18, 2023

FrenchKrab / IS2023-powerset-diarization

Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.

Jupyter Notebook 75 4 Updated Oct 18, 2023

mli / paper-reading

深度学习经典、新论文逐段精读

27,789 2,481 Updated Nov 17, 2024

datawhalechina / easy-rl

强化学习中文教程（蘑菇书🍄），在线阅读地址：https://datawhalechina.github.io/easy-rl/

Jupyter Notebook 9,901 1,908 Updated Nov 8, 2024

rshaojimmy / MultiModal-DeepFake

[TPAMI 2024 & CVPR 2023] PyTorch code for DGM4: Detecting and Grounding Multi-Modal Media Manipulation and beyond

Python 391 30 Updated Apr 23, 2024

BUTSpeechFIT / VBx

Variational Bayes HMM over x-vectors diarization

Python 259 57 Updated Jan 15, 2024

AIGC-Audio / AudioGPT

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

Python 10,087 868 Updated Jul 6, 2024

HuangZiliAndy / SSL_for_multitalker

ADAPTING SELF-SUPERVISED MODELS TO MULTI-TALKER SPEECH RECOGNITION USING SPEAKER EMBEDDINGS

Shell 27 1 Updated Mar 16, 2023

hyperion-ml / hyperion

Python toolkit for speech processing

Python 68 21 Updated Jan 9, 2025

k2-fsa / fast_rnnt

A torch implementation of a recursion which turns out to be useful for RNN-T.

Python 140 22 Updated Aug 25, 2023

openai / whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Python 74,169 8,862 Updated Jan 4, 2025

cyrta / awesome-speech-enhancement

A curated list of awesome Speech Enhancement papers, libraries, datasets, and other resources.

66 15 Updated Sep 9, 2019

openai / tiktoken

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 12,976 902 Updated Oct 3, 2024

speechbrain / speechbrain

A PyTorch-based Speech Toolkit

Python 9,175 1,416 Updated Jan 11, 2025

Anwarvic / Speaker-Recognition

This repo contains my attempt to create a Speaker Recognition and Verification system using SideKit-1.3.1

Python 110 33 Updated May 22, 2019

facebookresearch / encodec

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Python 3,560 311 Updated Jan 4, 2024

Lhx94As / JSTSP_w2v_for_LID

Source code for: Efficient Self-supervised Learning Representations for Spoken Language Identification

Python 4 Updated Sep 13, 2022

ldkong1205 / TranSVAE

[NeurIPS 2023] Unsupervised Video Domain Adaptation for Action Recognition: A Disentanglement Perspective

Jupyter Notebook 122 11 Updated Oct 25, 2023

zjc6666 / Accent-Recognition

Python 5 2 Updated Nov 23, 2021

wq2012 / SpeakerRecognitionFromScratch

Final project for the Speaker Recognition course on Udemy, 机器之心, 深蓝学院 and 语音之家

Python 43 14 Updated May 7, 2024

azl397985856 / leetcode

LeetCode Solutions: A Record of My Problem Solving Journey.( leetcode题解，记录自己的leetcode解题之路。)

JavaScript 54,924 9,466 Updated Dec 10, 2024

Lhx94As / PHO-LID

PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification

Python 21 2 Updated Aug 24, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hexin Liu Lhx94As

Achievements

Achievements

Block or report Lhx94As

Stars

yeyupiaoling / Whisper-Finetune

kaistmm / FlowAVSE

Tonyyouyou / Mutual-Information-Analysis

wutaiqiang / MoSLoRA

yangdongchao / AcademiCodec

YUCHEN005 / NASE

Tonyyouyou / Mamba-in-Speech

descriptinc / descript-audio-codec

chq1155 / A-Survey-on-Generative-Diffusion-Model

FrenchKrab / IS2023-powerset-diarization

mli / paper-reading

datawhalechina / easy-rl

rshaojimmy / MultiModal-DeepFake

BUTSpeechFIT / VBx

AIGC-Audio / AudioGPT

HuangZiliAndy / SSL_for_multitalker

hyperion-ml / hyperion

k2-fsa / fast_rnnt

openai / whisper

cyrta / awesome-speech-enhancement

openai / tiktoken

speechbrain / speechbrain

Anwarvic / Speaker-Recognition

facebookresearch / encodec

Lhx94As / JSTSP_w2v_for_LID

ldkong1205 / TranSVAE

zjc6666 / Accent-Recognition

wq2012 / SpeakerRecognitionFromScratch

azl397985856 / leetcode

Lhx94As / PHO-LID