Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…

Python 11,393 1,872 Updated Jan 17, 2025

THUDM / CogVideo

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 10,329 962 Updated Jan 16, 2025

Plachtaa / VALL-E-X

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/

Python 7,754 770 Updated Feb 11, 2024

oarriaga / face_classification

Real-time face detection and emotion/gender classification using fer2013/imdb datasets with a keras CNN model and openCV.

Python 5,624 1,594 Updated Mar 8, 2024

pkuliyi2015 / multidiffusion-upscaler-for-automatic1111

Tiled Diffusion and VAE optimize, licensed under CC BY-NC-SA 4.0

Python 4,818 341 Updated Aug 7, 2024

continue-revolution / sd-webui-segment-anything

Segment Anything for Stable Diffusion WebUI

Python 3,442 208 Updated Apr 30, 2024

kijai / ComfyUI-CogVideoXWrapper

Python 1,273 80 Updated Jan 14, 2025

ddlBoJack / emotion2vec

[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation

Python 716 55 Updated Dec 23, 2024

KevinWang676 / ChatGLM2-Voice-Cloning

Chat with any character you like: ChatGLM2+SadTalker+Voice Cloning | 和喜欢的角色沉浸式对话吧：ChatGLM2+声音克隆+视频对话

Python 596 92 Updated Aug 11, 2023

IronSpiderMan / MachineLearningPractice

机器学习实战案例，涉及机器学习、深度学习等各个方向。每个案例代码量在百行左右。

Python 194 34 Updated Jan 14, 2024

jinxycandotailwhip / opencv-singleEyeDetection-graduationDesign

利用单目测距原理实现柔性机器人三维坐标的返回，opencv+raspberrypi实现

Python 33 5 Updated May 5, 2021

CHB-learner / EmotionDetection_RealTime-master

Python 14 2 Updated Jul 10, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Jacob JacobNg1

Block or report JacobNg1

Lists (3)

无人机

自研

迷信上网

Stars

XingangPan / DragGAN

RVC-Project / Retrieval-based-Voice-Conversion-WebUI

LlamaFamily / Llama-Chinese

PaddlePaddle / PaddleSpeech