Skip to content
View Shengqiang-Li's full-sized avatar
💭
I may be slow to respond.
💭
I may be slow to respond.
  • Northwestern Polytechnical University
  • Suzhou
  • 13:09 (UTC +08:00)

Block or report Shengqiang-Li

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
292 results for source starred repositories
Clear filter

(WIP)long form speech generatoins

Python 28 3 Updated Dec 12, 2024

Reference implementation for DPO (Direct Preference Optimization)

Python 2,317 191 Updated Aug 11, 2024

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

C++ 11,048 2,150 Updated Dec 13, 2024

✨✨Latest Advances on Multimodal Large Language Models

13,506 857 Updated Jan 13, 2025

An educational resource to help anyone learn deep reinforcement learning.

Python 10,360 2,258 Updated Aug 5, 2024
Python 6 Updated Sep 16, 2024

Awesome Neural Codec Models, Text-to-Speech Synthesizers & Speech Language Models

Python 94 5 Updated Jan 14, 2025
JavaScript 18 13 Updated Aug 9, 2018

TTSAudioNormalizer is a specialized tool for TTS data production, featuring descriptive statistical analysis of audio loudness and loudness normalization operations.

Python 89 15 Updated Dec 20, 2024

《大语言模型》作者:赵鑫,李军毅,周昆,唐天一,文继荣

2,743 179 Updated Apr 22, 2024

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Python 2,706 163 Updated Dec 26, 2024

Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.

Python 12,269 1,312 Updated Jan 12, 2025

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Python 11,131 1,115 Updated Jan 12, 2025

Implementation of the CVPR 2019 Paper - Speech2Face: Learning the Face Behind a Voice by MIT CSAIL

Python 173 35 Updated Mar 24, 2023

A family of state-of-the-art Transformer-based audio codecs for low-bitrate high-quality audio coding.

Python 269 16 Updated Jan 12, 2025

Awesome speech/audio LLMs, representation learning, and codec models

844 54 Updated Jan 13, 2025

A Survey of Spoken Dialogue Models (60 pages)

247 15 Updated Nov 28, 2024

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

Python 1,989 140 Updated Jan 14, 2025

General Speech Restoration

Python 1,066 132 Updated May 31, 2024

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 6,662 807 Updated Jan 9, 2025

Manipulate audio with a simple and easy high level interface

Python 9,098 1,062 Updated Jul 25, 2024

Noise supression using deep filtering

Python 2,658 246 Updated Oct 17, 2024

GUI for a Vocal Remover that uses Deep Neural Networks.

Python 19,005 1,409 Updated Dec 9, 2024

✨✨Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model with Frozen LLM

Python 250 15 Updated Jan 2, 2025

Accessible large language models via k-bit quantization for PyTorch.

Python 6,507 647 Updated Jan 13, 2025

汉字转拼音(pypinyin)

Python 4,953 618 Updated Jan 3, 2025

Paper, Code and Resources for Speech Language Model and End2End Speech Dialogue System.

145 12 Updated Nov 10, 2024

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Python 3,726 357 Updated Jan 13, 2025

The official repo of NBC & SpatialNet for multichannel speech separation, denoising, and dereverberation

Python 246 30 Updated Jan 1, 2025

汉字拼音数据

Python 1,261 218 Updated Jan 12, 2025
Next