Liujingxiu23

Follow

Liujingxiu23

Follow

25 followers · 41 following

Achievements

Achievements

Stars

feizc / FluxMusic

Text-to-Music Generation with Rectified Flow Transformers

Python 1,538 116 Updated Sep 6, 2024

innnky / MagVITS

VITS with phoneme-level prosody modeling based on MaskGIT

Python 74 7 Updated Aug 31, 2024

zhenye234 / xcodec

Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model

Python 89 3 Updated Oct 1, 2024

lucidrains / transfusion-pytorch

Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI

Python 614 23 Updated Oct 7, 2024

jishengpeng / WavTokenizer

SOTA discrete acoustic codec models with 40 tokens per second for audio language modeling

Python 686 39 Updated Sep 21, 2024

yangdongchao / SimpleSpeech

The open source code for SimpleSpeech series

Python 90 6 Updated Aug 19, 2024

keonlee9420 / evaluate-zero-shot-tts

Evaluation Protocol for Large-Scale Zero-Shot TTS Literature

Python 49 7 Updated Sep 26, 2024

OpenT2S / LlamaVoice

LlamaVoice is a llama-based large voice generation model, providing inference and training ability.

Python 214 11 Updated Aug 26, 2024

theodorblackbird / lina-speech

lina-speech : linear attention based text-to-speech

Jupyter Notebook 116 9 Updated Jun 3, 2024

ictnlp / NAST-S2x

A fast speech-to-any translation model that supports simultaneous decoding and offers 28× speedup.

Python 60 4 Updated Aug 12, 2024

liutaocode / TTS-arxiv-daily

Automatically Update Text-to-speech (TTS) Papers Daily using Github Actions (Update Every 12th hours)

Python 232 19 Updated Oct 8, 2024

sizhelee / Diff-BGM

official code for CVPR'24 paper Diff-BGM

Python 40 3 Updated Mar 28, 2024

BradyFU / Awesome-Multimodal-Large-Language-Models

✨✨Latest Advances on Multimodal Large Language Models

12,046 769 Updated Oct 7, 2024

huggingface / parler-tts

Inference and training library for high-quality TTS models.

Python 4,319 436 Updated Sep 23, 2024

ex3ndr / supervoice-gpt

GPT-style network for phonemization with durations of text

Jupyter Notebook 62 9 Updated Mar 21, 2024

davidmartinrius / speech-dataset-generator

🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.

Python 197 20 Updated Jun 10, 2024

enragedginger / cover-song-recognition-much

Python 9 2 Updated Jun 20, 2019

Audio-AGI / AudioSep

Official implementation of "Separate Anything You Describe"

Python 1,587 115 Updated Mar 31, 2024

fishaudio / Bert-VITS2

vits2 backbone with multilingual-bert

Python 7,862 1,114 Updated Oct 7, 2024

Vaibhavs10 / open-tts-tracker

1,083 69 Updated Jun 21, 2024

wenet-e2e / speech-synthesis-paper

List of speech synthesis papers.

993 120 Updated Jul 24, 2023

yxlu-0102 / MP-SENet

Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement

Python 293 44 Updated Sep 13, 2024

Liujingxiu23 / MP-SENet

Forked from yxlu-0102/MP-SENet

MP-SENet: A Speech Enhancement Model with Parallel Denoising of Magnitude and Phase Spectra

Python 1 Updated Dec 24, 2023

TomJwYu / WenetSpeechSpeakerCluster

55 2 Updated Jul 17, 2023

Yuan-ManX / ai-audio-datasets

AI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio a…

476 33 Updated Sep 6, 2024

Liujingxiu23 / ai-audio-datasets-list

Forked from Yuan-ManX/ai-audio-datasets

This is a list of datasets consisting of speech, music, and sound effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio …

1 Updated Nov 8, 2023

jackaduma / awesome_LLMs_interview_notes

LLMs interview notes and answers:该仓库主要记录大模型（LLMs）算法工程师相关的面试题和参考答案

1,149 255 Updated Dec 14, 2023

BrasD99 / HeyGenClone

A simple and open-source analogue of the HeyGen system

Python 881 178 Updated Aug 1, 2024

innnky / ar-vits

text to speech using autoregressive transformer and VITS

Python 224 15 Updated Apr 3, 2024

b04901014 / MQTTS

Python 253 35 Updated May 15, 2023