Skip to content
View SEMLLYCAT's full-sized avatar

Block or report SEMLLYCAT

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 5 Updated Feb 13, 2025

Python library for extracting chords from multiple sound file formats

Python 163 27 Updated Feb 9, 2025

A simple screen parsing tool towards pure vision based GUI agent

Jupyter Notebook 18,900 1,502 Updated Feb 23, 2025

🚀 Efficient implementations of state-of-the-art linear attention models in Torch and Triton

Python 2,036 125 Updated Mar 2, 2025

code based for rectified flow

Python 160 6 Updated Feb 21, 2025

AEC3 Extracted From WebRTC

C++ 168 79 Updated Feb 24, 2022

Implementation of the proposed minGRU in Pytorch

Python 281 22 Updated Feb 13, 2025

Official inference framework for 1-bit LLMs

C++ 12,777 898 Updated Feb 18, 2025

This is the official implementation of the LiSenNet

Python 56 5 Updated Nov 15, 2024

offical code for Dense-TSNet

11 Updated Sep 17, 2024

Target Speaker Extraction Toolkit

Python 146 16 Updated Feb 7, 2025

Apply Score diffusion to improve speech signals recorded under various adverse conditions and distortions, including noise, reverberation, clipping, equalization (EQ) distortion, packet loss, codec…

Python 56 4 Updated Jul 29, 2024

Port of Funasr's Sense-voice model in C/C++

C 269 27 Updated Feb 25, 2025

Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.

654 38 Updated Aug 3, 2024

该代码与B站上的视频 https://www.bilibili.com/video/BV18L41197Uz/?spm_id_from=333.788&vd_source=eefa4b6e337f16d87d87c2c357db8ca7 相关联。

Python 66 3 Updated Oct 7, 2023

This is the code and dataset repo for Interspeech 2024 paper "Target conversation extraction: Source separation using turn-taking dynamics"

Python 45 4 Updated Oct 4, 2024

ESC-50: Dataset for Environmental Sound Classification

Python 1,483 293 Updated Mar 20, 2024

On-device noise suppression powered by deep learning

Python 67 4 Updated Feb 20, 2025

模型压缩的小白入门教程

246 33 Updated Nov 19, 2024

The Billboard Melodic Music Dataset

44 3 Updated Jan 27, 2025
C++ 9 2 Updated Jul 17, 2024

Keep track of good articles on speech processing, mainly on speech enhancement, include speech denoise, speech dereverberation and aec、agc, etc.

43 5 Updated Jul 17, 2024

A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NeurIPS 2024]

Python 113 12 Updated Dec 11, 2024

Official implementation of "Separate Anything You Describe"

Python 1,693 120 Updated Nov 26, 2024

Real-time microphone noise suppression on Linux.

Go 9,519 226 Updated Jan 13, 2025

This is the CoNNear human auditory periphery model that simulates cochlear, IHC and AN processing across the human hearing range.

Python 4 1 Updated Dec 1, 2023

SonicSim: A customizable simulation platform for speech processing in moving sound source scenarios

Python 221 22 Updated Jan 22, 2025

A curated list of neural network pruning resources.

2,413 331 Updated Apr 4, 2024
Python 84 11 Updated Dec 18, 2024

On-device AI across mobile, embedded and edge for PyTorch

C++ 2,558 462 Updated Mar 4, 2025
Next