Skip to content
View nicktao9's full-sized avatar

Block or report nicktao9

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Deep learning for audio denoising

Python 676 128 Updated Oct 15, 2023

Port of Funasr's Sense-voice model in C/C++

C 225 16 Updated Jan 5, 2025

Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音,一键全自动视频搬运AI字幕组

Python 9,348 919 Updated Jan 5, 2025

🎤 Microphone sound source localization by SRP-PHAT and others numerical methods.(基于SRP-PHAT的麦克风声源定位)

MATLAB 202 33 Updated Sep 9, 2019

A simple library for theoretical research on direction-of-arrival (DOA) estimation in array signal processing.

Python 167 48 Updated Jan 28, 2021

A set of MATLAB functions for direction-of-arrival (DOA) estimation in array signal processing.

MATLAB 315 91 Updated Nov 7, 2018

关于语音信号声源定位DOA估计所用的一些传统算法

MATLAB 400 84 Updated Jun 30, 2021

Cross-platform C++ library providing a simple API to read and write INI-style configuration files

C++ 1,153 322 Updated Dec 9, 2024

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

Python 2,066 145 Updated Jan 22, 2025

This is the audio sample repository for speech separation model "MossFormer2".

Python 120 9 Updated Nov 28, 2024

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 9,838 954 Updated Jan 15, 2025

High-quality and streaming Speech-to-Speech interactive agent in a single file. 只用一个文件实现的流式全双工语音交互原型智能体!

Python 320 31 Updated Jan 10, 2025

LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.

Python 2,761 186 Updated Nov 14, 2024

CoreMark® is an industry-standard benchmark that measures the performance of central processing units (CPU) and embedded microcrontrollers (MCU).

C 993 338 Updated Aug 6, 2024

Easy to use Beamformers for multi-channel speech separation/enhancement

Python 190 49 Updated Jan 26, 2021

simple delaysum, MVDR and CGMM-MVDR

Python 247 74 Updated Jan 19, 2019

Implementation of the CGMM-MVDR beamforming (for python version please refer to https://github.com/funcwj/setk)

Python 146 55 Updated Aug 12, 2020

Noise suppression plugin based on Xiph's RNNoise

C++ 5,250 240 Updated May 18, 2024

Fast and accurate automatic speech recognition (ASR) for edge devices

Python 2,504 129 Updated Jan 14, 2025

The baseline system for the ICASSP2024 ICMC-ASR Challenge.

Python 47 9 Updated Dec 6, 2023

Multilingual Voice Understanding Model

Python 4,147 368 Updated Jan 8, 2025

The INI header-only library for Modern C++ supports reading and writing, even writing comments. It is cross-platform and can be used on multiple operating systems. - MIT license.

C++ 46 7 Updated Dec 23, 2024

📺IPTV电视直播源更新项目『✨秒播级体验🚀』:支持IPv4/IPv6;支持自定义频道;支持本地源、组播源、酒店源、订阅源、关键字搜索;每天自动更新两次,结果可用于TVBox等播放软件;支持工作流、Docker(amd64/arm64/arm v7)、命令行、GUI运行方式 | IPTV live TV source update project

Python 11,975 3,061 Updated Jan 22, 2025

PortAudio is a cross-platform, open-source C language library for real-time audio input and output.

C 1,590 318 Updated Nov 25, 2024

Jarvis:An intelligent assistant based voice control on Mac OS.中文版贾维斯Jarvis语音助手(电脑版Siri)

Python 70 20 Updated Dec 8, 2021

The Art World in Your Pocket or Your Trendy Tech Company's Tote, Artsy's mobile app.

TypeScript 3,612 582 Updated Jan 22, 2025

My favorite C programming practices.

2,028 98 Updated Oct 1, 2020

Streamer-Sales 销冠 —— 卖货主播 LLM 大模型🛒🎁,一个能够根据给定的商品特点从激发用户购买意愿角度出发进行商品解说的卖货主播大模型。🚀⭐内含详细的数据生成流程❗ 📦另外还集成了 LMDeploy 加速推理🚀、RAG检索增强生成 📚、TTS文字转语音🔊、数字人生成 🦸、 Agent 使用网络查询实时信息🌐、ASR 语音转文字🎙️、Vue 生态搭建前端🍍、FastAPI 搭…

Python 2,802 429 Updated Nov 11, 2024

本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)

HTML 13,213 1,485 Updated Jan 15, 2025

Production first, nn-based on-device signal processing toolkit.

64 3 Updated May 30, 2023
Next