Alan-2018

💭

I may be slow to respond.

Mr.C Alan-2018

💭

I may be slow to respond.

6 followers · 21 following

Achievements

Stars

ScottishFold007 / TTSAudioNormalizer

TTSAudioNormalizer is a specialized tool for TTS data production, featuring descriptive statistical analysis of audio loudness and loudness normalization operations.

Python 78 14 Updated Dec 20, 2024

comfyanonymous / ComfyUI

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 61,871 6,610 Updated Dec 31, 2024

ibttf / interview-coder

An open-source invisible desktop application to help you pass your technical interviews.

TypeScript 718 96 Updated Jan 1, 2025

lipku / LiveTalking

Real time interactive streaming digital human

Python 4,224 616 Updated Dec 29, 2024

wan-h / awesome-digital-human-live2d

Awesome Digital Human

TypeScript 1,050 109 Updated Dec 16, 2024

weihaox / awesome-digital-human

Digital Human Resource Collection: 2D/3D/4D human modeling, avatar generation & animation, clothed people digitalization, virtual try-on, and others.

1,545 144 Updated Oct 14, 2024

ccfddl / ccf-deadlines

⏰ Collaboratively track deadlines of conferences recommended by CCF (Website, Python Cli, Wechat Applet) / If you find it useful, please star this project, thanks~

Vue 6,643 455 Updated Dec 30, 2024

WeThinkIn / Interview-for-Algorithm-Engineer

【三年面试五年模拟】AI算法工程师面试秘籍。涵盖AIGC、传统深度学习、自动驾驶、机器学习、计算机视觉、自然语言处理、SLAM、具身智能、元宇宙、AGI等AI行业面试笔试经验与干货知识。

948 148 Updated Dec 30, 2024

ZHO-ZHO-ZHO / ComfyUI-Workflows-ZHO

我的 ComfyUI 工作流合集 | My ComfyUI workflows collection

5,535 525 Updated Dec 20, 2024

ictnlp / LLaMA-Omni

LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.

Python 2,695 185 Updated Nov 14, 2024

BeyondDimension / SteamTools

🛠「Watt Toolkit」是一个开源跨平台的多功能 Steam 工具箱。

C# 20,762 1,355 Updated Dec 19, 2024

Giftia / ChatDACS

一个简单的聊天机器人框架，支持接入多个平台，具备全功能的网页控制台。

JavaScript 75 19 Updated Dec 31, 2024

imuncle / live2d

live2d模型收集+展示，可直接用于静态网站

JavaScript 730 174 Updated Jun 17, 2022

datawhalechina / self-llm

《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调（全参数/Lora）、部署国内外开源大模型（LLM）/多模态大模型（MLLM）教程

Jupyter Notebook 10,599 1,225 Updated Dec 31, 2024

guansss / pixi-live2d-display

A PixiJS plugin to display Live2D models of any kind.

TypeScript 996 142 Updated Aug 20, 2024

microsoft / LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 11,027 699 Updated Dec 17, 2024

mlfoundations / dclm

DataComp for Language Models

HTML 1,190 108 Updated Dec 11, 2024

aliutkus / speechmetrics

A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR

Python 923 156 Updated Jul 5, 2023

stefantaubert / mean-opinion-score

Python library for calculating the mean opinion score and 95% confidence interval of the standard deviation of text-to-speech ratings according to Ribeiro et al. (2011).

Python 23 1 Updated Aug 11, 2023

ggarber / rtcscore

JS Library to estimate the Mean Opinion Score (MOS) for Real Time audio & video communications

JavaScript 35 4 Updated Apr 4, 2024

yeyupiaoling / VoiceprintRecognition-Pytorch

This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM++, etc. It is not excluded that more models will be supported in the future. At the …

Python 858 128 Updated Nov 19, 2024

mymagicpower / AIAS

免费，可商用，Java AI 人工智能一站式解决方案，为工作减负，为产品研发加速。项目类别包括：Java版 Pytorch 训练引擎，AI SDK，web应用等在内，合计超过100个项目组成的项目集。| Artificial Intelligence Accelerator Kit. It provides: a project collection consisting of over 1…

Java 796 273 Updated Dec 31, 2024

X-LANCE / StoryTTS

[ICASSP 2024] StoryTTS: A Highly Expressive Text-to-Speech Dataset with Rich Textual Expressiveness Annotations

HTML 137 4 Updated Apr 27, 2024

mangiucugna / json_repair

A python module to repair invalid JSON, commonly used to parse the output of LLMs

Python 1,328 69 Updated Dec 31, 2024

baidubce / bce-qianfan-sdk

Provide best practices for LMOps, as well as elegant and convenient access to the features of the Qianfan MaaS Platform. (提供大模型工具链最佳实践，以及优雅且便捷地访问千帆大模型平台）

Jupyter Notebook 345 53 Updated Dec 24, 2024

PaddlePaddle / PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…

Python 11,334 1,865 Updated Dec 31, 2024

NVIDIA / DeepLearningExamples

State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.

Jupyter Notebook 13,721 3,254 Updated Aug 12, 2024

xiaomingnio / kantts

TTS appalication based on modelscope KAN-TTS

Python 43 6 Updated Apr 11, 2024

innnky / ar-vits

text to speech using autoregressive transformer and VITS

Python 234 15 Updated Apr 3, 2024

dusty-nv / jetson-voice

ASR/NLP/TTS deep learning inference library for NVIDIA Jetson using PyTorch and TensorRT

Python 193 49 Updated Feb 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mr.C Alan-2018

Achievements

Achievements

Block or report Alan-2018

Stars

ScottishFold007 / TTSAudioNormalizer

comfyanonymous / ComfyUI

ibttf / interview-coder

lipku / LiveTalking

wan-h / awesome-digital-human-live2d

weihaox / awesome-digital-human

ccfddl / ccf-deadlines

WeThinkIn / Interview-for-Algorithm-Engineer

ZHO-ZHO-ZHO / ComfyUI-Workflows-ZHO

ictnlp / LLaMA-Omni

BeyondDimension / SteamTools

Giftia / ChatDACS

imuncle / live2d

datawhalechina / self-llm

guansss / pixi-live2d-display

microsoft / LoRA

mlfoundations / dclm

aliutkus / speechmetrics

stefantaubert / mean-opinion-score

ggarber / rtcscore

yeyupiaoling / VoiceprintRecognition-Pytorch

mymagicpower / AIAS

X-LANCE / StoryTTS

mangiucugna / json_repair

baidubce / bce-qianfan-sdk

PaddlePaddle / PaddleSpeech

NVIDIA / DeepLearningExamples

xiaomingnio / kantts

innnky / ar-vits

dusty-nv / jetson-voice