Lists (3)
Sort Name ascending (A-Z)
Stars
Official Code for DragGAN (SIGGRAPH 2023)
Easily train a good VC model with voice data <= 10 mins!
Llama中文社区,Llama3在线体验和微调模型已开放,实时汇总最新Llama3学习资料,已将所有代码更新适配Llama3,构建最好的中文Llama大模型,完全开源可商用
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/
Real-time face detection and emotion/gender classification using fer2013/imdb datasets with a keras CNN model and openCV.
Tiled Diffusion and VAE optimize, licensed under CC BY-NC-SA 4.0
Segment Anything for Stable Diffusion WebUI
[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation
Chat with any character you like: ChatGLM2+SadTalker+Voice Cloning | 和喜欢的角色沉浸式对话吧:ChatGLM2+声音克隆+视频对话
机器学习实战案例,涉及机器学习、深度学习等各个方向。每个案例代码量在百行左右。
利用单目测距原理实现柔性机器人三维坐标的返回,opencv+raspberrypi实现