Skip to content
View nicktao9's full-sized avatar

Block or report nicktao9

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Speech Processing Toolbox for MATLAB

MATLAB 241 69 Updated Mar 5, 2025

Deep learning for audio denoising

Python 687 130 Updated Oct 15, 2023

Port of Funasr's Sense-voice model in C/C++

C 273 28 Updated Mar 4, 2025

Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音,一键全自动视频搬运AI字幕组

Python 11,681 1,130 Updated Feb 16, 2025

🎤 Microphone sound source localization by SRP-PHAT and others numerical methods.(基于SRP-PHAT的麦克风声源定位)

MATLAB 206 33 Updated Sep 9, 2019

A simple library for theoretical research on direction-of-arrival (DOA) estimation in array signal processing.

Python 169 47 Updated Jan 28, 2021

A set of MATLAB functions for direction-of-arrival (DOA) estimation in array signal processing.

MATLAB 319 90 Updated Nov 7, 2018

关于语音信号声源定位DOA估计所用的一些传统算法

MATLAB 409 84 Updated Jun 30, 2021

Cross-platform C++ library providing a simple API to read and write INI-style configuration files

C++ 1,166 328 Updated Dec 9, 2024

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

Python 2,351 176 Updated Feb 14, 2025

This is the audio sample repository for speech separation model "MossFormer2".

Python 120 9 Updated Nov 28, 2024

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 11,556 1,148 Updated Mar 7, 2025

High-quality and streaming Speech-to-Speech interactive agent in a single file. 只用一个文件实现的流式全双工语音交互原型智能体!

Python 349 33 Updated Mar 4, 2025

LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.

Python 2,838 194 Updated Nov 14, 2024

CoreMark® is an industry-standard benchmark that measures the performance of central processing units (CPU) and embedded microcrontrollers (MCU).

C 1,009 345 Updated Aug 6, 2024

Easy to use Beamformers for multi-channel speech separation/enhancement

Python 192 49 Updated Jan 26, 2021

simple delaysum, MVDR and CGMM-MVDR

Python 252 74 Updated Jan 19, 2019

Implementation of the CGMM-MVDR beamforming (for python version please refer to https://github.com/funcwj/setk)

Python 146 55 Updated Aug 12, 2020

Noise suppression plugin based on Xiph's RNNoise

C++ 5,372 244 Updated May 18, 2024

Fast and accurate automatic speech recognition (ASR) for edge devices

Python 2,616 136 Updated Feb 26, 2025

The baseline system for the ICASSP2024 ICMC-ASR Challenge.

Python 47 9 Updated Dec 6, 2023

Multilingual Voice Understanding Model

Python 4,777 433 Updated Jan 8, 2025

The INI header-only library for Modern C++ supports reading and writing, even writing comments. It is cross-platform and can be used on multiple operating systems. - MIT license.

C++ 50 7 Updated Feb 8, 2025

📺IPTV电视直播源更新项目『✨秒播级体验🚀』:支持IPv4/IPv6;支持自定义频道;支持本地源、组播源、酒店源、订阅源、关键字搜索;每天自动更新两次,结果可用于TVBox等播放软件;支持工作流、Docker(amd64/arm64/arm v7)、命令行、GUI运行方式 | IPTV live TV source update project

Python 14,009 3,831 Updated Mar 6, 2025

PortAudio is a cross-platform, open-source C language library for real-time audio input and output.

C 1,634 325 Updated Feb 8, 2025

Jarvis:An intelligent assistant based voice control on Mac OS.中文版贾维斯Jarvis语音助手(电脑版Siri)

Python 71 20 Updated Dec 8, 2021

The Art World in Your Pocket or Your Trendy Tech Company's Tote, Artsy's mobile app.

TypeScript 3,635 586 Updated Mar 6, 2025

My favorite C programming practices.

2,048 98 Updated Oct 1, 2020

Streamer-Sales 销冠 —— 卖货主播 LLM 大模型🛒🎁,一个能够根据给定的商品特点从激发用户购买意愿角度出发进行商品解说的卖货主播大模型。🚀⭐内含详细的数据生成流程❗ 📦另外还集成了 LMDeploy 加速推理🚀、RAG检索增强生成 📚、TTS文字转语音🔊、数字人生成 🦸、 Agent 使用网络查询实时信息🌐、ASR 语音转文字🎙️、Vue 生态搭建前端🍍、FastAPI 搭…

Python 3,004 461 Updated Nov 11, 2024

本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)

HTML 14,974 1,733 Updated Mar 2, 2025
Next