nicktao9

NickTao nicktao9

0 followers · 18 following

Starred repositories

ImperialCollegeLondon / sap-voicebox

Speech Processing Toolbox for MATLAB

MATLAB 241 69 Updated Mar 5, 2025

vbelz / Speech-enhancement

Deep learning for audio denoising

Python 687 130 Updated Oct 15, 2023

lovemefan / SenseVoice.cpp

Port of Funasr's Sense-voice model in C/C++

C 273 28 Updated Mar 4, 2025

Huanshere / VideoLingo

Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音，一键全自动视频搬运AI字幕组

Python 11,681 1,130 Updated Feb 16, 2025

xiaoli1368 / Microphone-sound-source-localization

🎤 Microphone sound source localization by SRP-PHAT and others numerical methods.（基于SRP-PHAT的麦克风声源定位）

MATLAB 206 33 Updated Sep 9, 2019

morriswmz / doatools.py

A simple library for theoretical research on direction-of-arrival (DOA) estimation in array signal processing.

Python 169 47 Updated Jan 28, 2021

morriswmz / doa-tools

A set of MATLAB functions for direction-of-arrival (DOA) estimation in array signal processing.

MATLAB 319 90 Updated Nov 7, 2018

WenzheLiu-Speech / sound-source-localization-algorithm_DOA_estimation

关于语音信号声源定位DOA估计所用的一些传统算法

MATLAB 409 84 Updated Jun 30, 2021

brofield / simpleini

Cross-platform C++ library providing a simple API to read and write INI-style configuration files

C++ 1,166 328 Updated Dec 9, 2024

modelscope / ClearerVoice-Studio

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

Python 2,351 176 Updated Feb 14, 2025

alibabasglab / MossFormer2

This is the audio sample repository for speech separation model "MossFormer2".

Python 120 9 Updated Nov 28, 2024

FunAudioLLM / CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 11,556 1,148 Updated Mar 7, 2025

opendilab / CleanS2S

High-quality and streaming Speech-to-Speech interactive agent in a single file. 只用一个文件实现的流式全双工语音交互原型智能体！

Python 349 33 Updated Mar 4, 2025

ictnlp / LLaMA-Omni

LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.

Python 2,838 194 Updated Nov 14, 2024

eembc / coremark

CoreMark® is an industry-standard benchmark that measures the performance of central processing units (CPU) and embedded microcrontrollers (MCU).

C 1,009 345 Updated Aug 6, 2024

Enny1991 / beamformers

Easy to use Beamformers for multi-channel speech separation/enhancement

Python 192 49 Updated Jan 26, 2021

AkojimaSLP / Beamforming-for-speech-enhancement

simple delaysum, MVDR and CGMM-MVDR

Python 252 74 Updated Jan 19, 2019

funcwj / CGMM-MVDR

Implementation of the CGMM-MVDR beamforming (for python version please refer to https://github.com/funcwj/setk)

Python 146 55 Updated Aug 12, 2020

werman / noise-suppression-for-voice

Noise suppression plugin based on Xiph's RNNoise

C++ 5,372 244 Updated May 18, 2024

usefulsensors / moonshine

Fast and accurate automatic speech recognition (ASR) for edge devices

Python 2,616 136 Updated Feb 26, 2025

MrSupW / ICMC-ASR_Baseline

The baseline system for the ICASSP2024 ICMC-ASR Challenge.

Python 47 9 Updated Dec 6, 2023

FunAudioLLM / SenseVoice

Multilingual Voice Understanding Model

Python 4,777 433 Updated Jan 8, 2025

dujingning / inicpp

The INI header-only library for Modern C++ supports reading and writing, even writing comments. It is cross-platform and can be used on multiple operating systems. - MIT license.

C++ 50 7 Updated Feb 8, 2025

Guovin / iptv-api

📺IPTV电视直播源更新项目『✨秒播级体验🚀』：支持IPv4/IPv6；支持自定义频道；支持本地源、组播源、酒店源、订阅源、关键字搜索；每天自动更新两次，结果可用于TVBox等播放软件；支持工作流、Docker(amd64/arm64/arm v7)、命令行、GUI运行方式 | IPTV live TV source update project

Python 14,009 3,831 Updated Mar 6, 2025

PortAudio / portaudio

PortAudio is a cross-platform, open-source C language library for real-time audio input and output.

C 1,634 325 Updated Feb 8, 2025

edisonwong520 / jarvis

Jarvis：An intelligent assistant based voice control on Mac OS.中文版贾维斯Jarvis语音助手(电脑版Siri)

Python 71 20 Updated Dec 8, 2021

artsy / eigen

The Art World in Your Pocket or Your Trendy Tech Company's Tote, Artsy's mobile app.

TypeScript 3,635 586 Updated Mar 6, 2025

mcinglis / c-style

My favorite C programming practices.

2,048 98 Updated Oct 1, 2020

PeterH0323 / Streamer-Sales

Streamer-Sales 销冠 —— 卖货主播 LLM 大模型🛒🎁，一个能够根据给定的商品特点从激发用户购买意愿角度出发进行商品解说的卖货主播大模型。🚀⭐内含详细的数据生成流程❗ 📦另外还集成了 LMDeploy 加速推理🚀、RAG检索增强生成 📚、TTS文字转语音🔊、数字人生成 🦸、 Agent 使用网络查询实时信息🌐、ASR 语音转文字🎙️、Vue 生态搭建前端🍍、FastAPI 搭…

Python 3,004 461 Updated Nov 11, 2024

liguodongiot / llm-action

本项目旨在分享大模型相关技术原理以及实战经验（大模型工程化、大模型应用落地）

HTML 14,974 1,733 Updated Mar 2, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly