Shengqiang-Li

Follow

💭

I may be slow to respond.

Shengqiang Li Shengqiang-Li

💭

I may be slow to respond.

Follow

Speech recognition/synthesis

23 followers · 41 following

Northwestern Polytechnical University
Suzhou
09:01 (UTC +08:00)

Achievements

Achievements

Lists (12)

Sort

ASR

ASV

Codec

14 repositories

Corpus

Inference

LLM

38 repositories

Others

RL

SE

TTS

54 repositories

VC

VPN

Stars

223 stars written in Python

xtekky / gpt4free

The official gpt4free repository | various collection of powerful language models

Python 62,883 13,458 Updated Dec 21, 2024

THUDM / ChatGLM-6B

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Python 40,899 5,235 Updated Jun 27, 2024

RVC-Boss / GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 37,398 4,246 Updated Dec 19, 2024

coqui-ai / TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 36,383 4,474 Updated Aug 16, 2024

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 32,685 4,975 Updated Dec 28, 2024

lllyasviel / ControlNet

Let us control diffusion models!

Python 31,045 2,786 Updated Feb 25, 2024

facebookresearch / fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 30,733 6,434 Updated Oct 18, 2024

myshell-ai / OpenVoice

Instant voice cloning by MIT and MyShell.

Python 30,203 2,986 Updated Dec 24, 2024

svc-develop-team / so-vits-svc

SoftVC VITS Singing Voice Conversion

Python 26,196 4,875 Updated Nov 11, 2023

RVC-Project / Retrieval-based-Voice-Conversion-WebUI

Easily train a good VC model with voice data <= 10 mins!

Python 25,495 3,713 Updated Nov 24, 2024

hpcaitech / Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Python 22,891 2,252 Updated Dec 27, 2024

microsoft / unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 20,480 2,575 Updated Dec 15, 2024

unslothai / unsloth

Finetune Llama 3.3, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 70% less memory

Python 19,712 1,388 Updated Dec 27, 2024

mlc-ai / mlc-llm

Universal LLM Deployment Engine with ML Compilation

Python 19,490 1,602 Updated Dec 19, 2024

Anjok07 / ultimatevocalremovergui

GUI for a Vocal Remover that uses Deep Neural Networks.

Python 18,769 1,395 Updated Dec 9, 2024

fishaudio / fish-speech

SOTA Open Source TTS

Python 17,774 1,328 Updated Dec 25, 2024

w-okada / voice-changer

リアルタイムボイスチェンジャー Realtime Voice Changer

Python 16,796 1,827 Updated Nov 14, 2024

ddbourgin / numpy-ml

Machine learning, in numpy

Python 15,829 3,768 Updated Oct 29, 2023

SYSTRAN / faster-whisper

Faster Whisper transcription with CTranslate2

Python 13,152 1,103 Updated Dec 23, 2024

openai / tiktoken

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 12,811 880 Updated Oct 3, 2024

NVIDIA / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 12,555 2,575 Updated Dec 28, 2024

PaddlePaddle / PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…

Python 11,316 1,866 Updated Dec 27, 2024

instantX-research / InstantID

InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥

Python 11,262 823 Updated Jul 18, 2024

Rudrabha / Wav2Lip

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs

Python 11,047 2,339 Updated Nov 26, 2024

Lightning-AI / litgpt

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Python 10,999 1,095 Updated Dec 26, 2024

kornia / kornia

Geometric Computer Vision Library for Spatial AI

Python 10,101 978 Updated Dec 25, 2024

Shubhamsaboo / awesome-llm-apps

Collection of awesome LLM apps with RAG using OpenAI, Anthropic, Gemini and opensource models.

Python 9,856 986 Updated Dec 27, 2024

cumulo-autumn / StreamDiffusion

StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation

Python 9,822 715 Updated Dec 4, 2024

speechbrain / speechbrain

A PyTorch-based Speech Toolkit

Python 9,108 1,413 Updated Dec 20, 2024

jiaaro / pydub

Manipulate audio with a simple and easy high level interface

Python 9,050 1,055 Updated Jul 25, 2024