Skip to content
View KdaiP's full-sized avatar

Block or report KdaiP

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Next-generation TTS model using flow-matching and DiT, inspired by Stable Diffusion 3

Python 396 42 Updated Sep 13, 2024

A curated list of my favourite music DSP and audio programming resources

2,659 88 Updated Mar 19, 2025

AI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio a…

686 55 Updated Feb 25, 2025

🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).

1,877 237 Updated Jun 6, 2024

Reference-aware automatic speech evaluation toolkit

Python 145 12 Updated Dec 5, 2024

小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫

Python 21,206 6,143 Updated Mar 23, 2025

Full stack, modern web application template. Using FastAPI, React, SQLModel, PostgreSQL, Docker, GitHub Actions, automatic HTTPS and more.

TypeScript 31,494 5,819 Updated Mar 19, 2025

JiOu-LLM: 基于llama2的奇偶数判别模型

Python 4 1 Updated Mar 11, 2024

Implementation of Differentiable Digital Signal Processing (DDSP) in Pytorch

C 465 56 Updated Oct 28, 2023

DDSP: Differentiable Digital Signal Processing

Python 2,993 351 Updated Sep 23, 2024

逐行解释的pytorch自编码器实现,使用MNIST数据集进行训练,保证代码简单。

Python 18 1 Updated Feb 9, 2024

An Open Source text-to-speech system built by inverting Whisper.

Jupyter Notebook 4,167 234 Updated Dec 12, 2024

Train the next generation of TTS systems.

Python 165 17 Updated Sep 13, 2024

distortion/saturation plugin

C++ 41 1 Updated Jul 12, 2024

JS Inflator is a copy of Sonox Inflator.

C++ 270 28 Updated Jan 11, 2025

Conditioning and feature fusion methods such as FiLM, Conditional Layer Norm and AdaIN.

Python 7 2 Updated Feb 10, 2024

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Jupyter Notebook 48,598 5,195 Updated Jan 22, 2025

The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,650 118 Updated Jul 5, 2024

A curated list of JUCE modules, templates, plugins, oh my!

Ruby 938 45 Updated Mar 27, 2025

Collection of tutorials & resources for the C++ library JUCE

Makefile 115 11 Updated Jun 7, 2020

Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系,成为中文AIGC和认知智能的基础设施。

Python 4,095 379 Updated Aug 13, 2024

PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean, and your own languages.

Python 297 47 Updated Aug 25, 2021

Finetune MobileSAM with Less Than 4GB RAM!

Jupyter Notebook 22 5 Updated Nov 12, 2023

a huggingface mirror site.

275 36 Updated Mar 18, 2024

An Efficient Lexical Analyzer for Chinese

Python 2,053 335 Updated Jan 31, 2022

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 88,329 23,711 Updated Mar 28, 2025

Vector (and Scalar) Quantization, in Pytorch

Python 3,072 246 Updated Mar 24, 2025

Extract the voice and corresponding text

C# 79 9 Updated Jan 20, 2025

Fast Segment Anything

Python 7,788 724 Updated Jul 30, 2024

Implementation of Nougat Neural Optical Understanding for Academic Documents

Python 9,359 604 Updated Feb 21, 2025
Next