11721206

11721206

9 followers · 176 following

Lists (2)

Sort

SSP

8 repositories

TTS

6 repositories

Stars

nii-yamagishilab / ZMM-TTS

ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations

C 137 9 Updated Mar 6, 2024

KdaiP / StableTTS

Next-generation TTS model using flow-matching and DiT, inspired by Stable Diffusion 3

Python 372 42 Updated Sep 13, 2024

lmfit / lmfit-py

Non-Linear Least Squares Minimization, with flexible Parameter settings, based on scipy.optimize, and with many additional classes and methods for curve fitting.

Python 1,090 279 Updated Dec 15, 2024

Choddeok / EmoSpherepp

The official implementation of EmoSphere++

Python 62 5 Updated Nov 6, 2024

AaronZ345 / TCSinger

PyTorch Implementation of TCSinger(EMNLP 2024): Zero-Shot Singing Voice Synthesis with Style Transfer and Multi-Level Style Control

Python 284 49 Updated Dec 7, 2024

hustcxl / SP_Lib

Signal processing method and algorithm library

240 60 Updated Aug 25, 2020

TongTong313 / rectified-flow

从零手搓Flow Matching（Rectified Flow）

Python 222 11 Updated Dec 7, 2024

swagger-coder / ASDA

This is an official PyTorch implementation of ASDA (accepted by ACMMM 2024).

Python 17 Updated Oct 22, 2024

THUDM / GLM-4-Voice

GLM-4-Voice | 端到端中英语音对话模型

Python 2,510 201 Updated Dec 5, 2024

3b1b / manim

Animation engine for explanatory math videos

Python 73,261 6,398 Updated Dec 28, 2024

SWivid / F5-TTS

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 8,542 1,098 Updated Dec 29, 2024

shuzijun / leetcode-editor

Do Leetcode exercises in IDE, support leetcode.com and leetcode-cn.com, to meet the basic needs of doing exercises.Support theoretically: IntelliJ IDEA PhpStorm WebStorm PyCharm RubyMine AppCode CL…

Java 3,769 405 Updated Aug 12, 2024

ivcylc / OpenMusic

OpenMusic: SOTA Text-to-music (TTM) Generation

Python 507 50 Updated Dec 21, 2024

FireRedTeam / FireRedTTS

An Open-Sourced LLM-empowered Foundation TTS System

Python 510 36 Updated Oct 17, 2024

benfred / py-spy

Sampling profiler for Python programs

Rust 13,070 438 Updated Dec 17, 2024

yangdongchao / SimpleSpeech

The open source code for SimpleSpeech series

Python 120 6 Updated Oct 8, 2024

FunAudioLLM / CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 8,999 873 Updated Dec 31, 2024

OpenT2S / LlamaVoice

LlamaVoice is a llama-based large voice generation model, providing inference and training ability.

Python 225 12 Updated Aug 26, 2024

nothings / stb

stb single-file public domain libraries for C/C++

C 27,403 7,733 Updated Nov 9, 2024

andreasfertig / cppinsights

C++ Insights - See your source code with the eyes of a compiler

C++ 4,151 244 Updated Oct 21, 2024

breakfastquay / rubberband

Official mirror of Rubber Band Library, an audio time-stretching and pitch-shifting library.

C++ 591 96 Updated Oct 25, 2024

enginBozkurt / Error-State-Extended-Kalman-Filter

Vehicle State Estimation using Error-State Extended Kalman Filter

Python 240 54 Updated Jun 30, 2023

XianruiWang / MCSSFDAF

Multichannel State Space Frequency-Domain Adaptive Filtering(MCSSFDAF)

Python 4 2 Updated May 25, 2024

CoatiSoftware / Sourcetrail

Sourcetrail - free and open-source interactive source explorer

C++ 15,008 1,421 Updated Dec 13, 2021

scutcsq / Neural-Transducers-for-Two-Stage-Text-to-Speech-via-Semantic-Token-Prediction

Unofficial pytorch reproduction for the paper "Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction" (arXiv:2401.01498)

Python 59 4 Updated Apr 4, 2024

facebookresearch / audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Python 21,240 2,189 Updated Nov 11, 2024

yxlllc / ReFlow-VAE-SVC

Python 123 18 Updated Sep 25, 2024

fgnt / pb_bss

Collection of EM algorithms for blind source separation of audio signals

Python 276 60 Updated Aug 1, 2024

inboxpraveen / ASR-Accuracy-Tool

🎙️📝 A powerful Flask-based web application that leverages the latest Hugging Face ASR models to provide real-time speech-to-text (STT) transcripts with an intuitive user interface for easy correcti…

Python 6 4 Updated Oct 17, 2023

rfetick / Kalman

Implement Kalman filter for your Arduino projects

C++ 137 15 Updated Aug 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly