Skip to content
View Alan-2018's full-sized avatar
💭
I may be slow to respond.
💭
I may be slow to respond.

Block or report Alan-2018

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

TTSAudioNormalizer is a specialized tool for TTS data production, featuring descriptive statistical analysis of audio loudness and loudness normalization operations.

Python 78 14 Updated Dec 20, 2024

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 61,871 6,610 Updated Dec 31, 2024

An open-source invisible desktop application to help you pass your technical interviews.

TypeScript 718 96 Updated Jan 1, 2025

Real time interactive streaming digital human

Python 4,224 616 Updated Dec 29, 2024

Awesome Digital Human

TypeScript 1,050 109 Updated Dec 16, 2024

Digital Human Resource Collection: 2D/3D/4D human modeling, avatar generation & animation, clothed people digitalization, virtual try-on, and others.

1,545 144 Updated Oct 14, 2024

⏰ Collaboratively track deadlines of conferences recommended by CCF (Website, Python Cli, Wechat Applet) / If you find it useful, please star this project, thanks~

Vue 6,643 455 Updated Dec 30, 2024

【三年面试五年模拟】AI算法工程师面试秘籍。涵盖AIGC、传统深度学习、自动驾驶、机器学习、计算机视觉、自然语言处理、SLAM、具身智能、元宇宙、AGI等AI行业面试笔试经验与干货知识。

948 148 Updated Dec 30, 2024

我的 ComfyUI 工作流合集 | My ComfyUI workflows collection

5,535 525 Updated Dec 20, 2024

LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.

Python 2,695 185 Updated Nov 14, 2024

🛠「Watt Toolkit」是一个开源跨平台的多功能 Steam 工具箱。

C# 20,762 1,355 Updated Dec 19, 2024

一个简单的聊天机器人框架,支持接入多个平台,具备全功能的网页控制台。

JavaScript 75 19 Updated Dec 31, 2024

live2d模型收集+展示,可直接用于静态网站

JavaScript 730 174 Updated Jun 17, 2022

《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程

Jupyter Notebook 10,599 1,225 Updated Dec 31, 2024

A PixiJS plugin to display Live2D models of any kind.

TypeScript 996 142 Updated Aug 20, 2024

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 11,027 699 Updated Dec 17, 2024

DataComp for Language Models

HTML 1,190 108 Updated Dec 11, 2024

A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR

Python 923 156 Updated Jul 5, 2023

Python library for calculating the mean opinion score and 95% confidence interval of the standard deviation of text-to-speech ratings according to Ribeiro et al. (2011).

Python 23 1 Updated Aug 11, 2023

JS Library to estimate the Mean Opinion Score (MOS) for Real Time audio & video communications

JavaScript 35 4 Updated Apr 4, 2024

This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM++, etc. It is not excluded that more models will be supported in the future. At the …

Python 858 128 Updated Nov 19, 2024

免费,可商用,Java AI 人工智能一站式解决方案,为工作减负,为产品研发加速。项目类别包括:Java版 Pytorch 训练引擎,AI SDK,web应用等在内,合计超过100个项目组成的项目集。| Artificial Intelligence Accelerator Kit. It provides: a project collection consisting of over 1…

Java 796 273 Updated Dec 31, 2024

[ICASSP 2024] StoryTTS: A Highly Expressive Text-to-Speech Dataset with Rich Textual Expressiveness Annotations

HTML 137 4 Updated Apr 27, 2024

A python module to repair invalid JSON, commonly used to parse the output of LLMs

Python 1,328 69 Updated Dec 31, 2024

Provide best practices for LMOps, as well as elegant and convenient access to the features of the Qianfan MaaS Platform. (提供大模型工具链最佳实践,以及优雅且便捷地访问千帆大模型平台)

Jupyter Notebook 345 53 Updated Dec 24, 2024

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…

Python 11,334 1,865 Updated Dec 31, 2024

State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.

Jupyter Notebook 13,721 3,254 Updated Aug 12, 2024

TTS appalication based on modelscope KAN-TTS

Python 43 6 Updated Apr 11, 2024

text to speech using autoregressive transformer and VITS

Python 234 15 Updated Apr 3, 2024

ASR/NLP/TTS deep learning inference library for NVIDIA Jetson using PyTorch and TensorRT

Python 193 49 Updated Feb 9, 2024
Next