Shengqiang Li Shengqiang-Li

💭

I may be slow to respond.

Speech recognition/synthesis

23 followers · 41 following

Northwestern Polytechnical University
Suzhou
01:19 (UTC +08:00)

Achievements

Lists (12)

Sort

Stars

e1tts / e1tts.github.io

Python 6 Updated Sep 16, 2024

LqNoob / Neural-Codec-and-Speech-Language-Models

Awesome Neural Codec Models, Text-to-Speech Synthesizers & Speech Language Models

Python 82 2 Updated Dec 26, 2024

MU94W / TTS-Eval

JavaScript 18 13 Updated Aug 9, 2018

ScottishFold007 / TTSAudioNormalizer

TTSAudioNormalizer is a specialized tool for TTS data production, featuring descriptive statistical analysis of audio loudness and loudness normalization operations.

Python 78 14 Updated Dec 20, 2024

LLMBook-zh / LLMBook-zh.github.io

《大语言模型》作者：赵鑫，李军毅，周昆，唐天一，文继荣

2,417 159 Updated Apr 22, 2024

InternLM / InternLM-XComposer

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Python 2,676 159 Updated Dec 26, 2024

Shubhamsaboo / awesome-llm-apps

Collection of awesome LLM apps with RAG using OpenAI, Anthropic, Gemini and opensource models.

Python 9,650 960 Updated Dec 26, 2024

Lightning-AI / litgpt

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Python 10,981 1,093 Updated Dec 26, 2024

saiteja-talluri / Speech2Face

Implementation of the CVPR 2019 Paper - Speech2Face: Learning the Face Behind a Voice by MIT CSAIL

Python 173 35 Updated Mar 24, 2023

facebookresearch / VisualVoice

Audio-Visual Speech Separation with Cross-Modal Consistency

Python 225 36 Updated Jul 25, 2023

Stability-AI / stable-codec

A family of state-of-the-art Transformer-based audio codecs for low-bitrate high-quality audio coding.

189 2 Updated Dec 3, 2024

ga642381 / speech-trident

Awesome speech/audio LLMs, representation learning, and codec models

794 48 Updated Dec 21, 2024

jishengpeng / WavChat

A Survey of Spoken Dialogue Models (60 pages)

222 13 Updated Nov 28, 2024

modelscope / ClearerVoice-Studio

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

Python 1,867 129 Updated Dec 25, 2024