Stars
Command-line program to download videos from YouTube.com and other video sites
Clone a voice in 5 seconds to generate arbitrary speech in real-time
A Gradio web UI for Large Language Models with support for multiple inference backends.
GUI for a Vocal Remover that uses Deep Neural Networks.
WebUI extension for ControlNet
Experience macOS just like before
RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
Stable Diffusion built-in to Blender
A lightweight yet powerful audio-to-MIDI converter with pitch bend detection
Pydoll is a library for automating chromium-based browsers without a WebDriver, offering realistic interactions.
Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorch
Generate and download e-books from online sources.
set prompt to divided region
A family of diffusion models for text-to-audio generation.
Rewrite of blender-wiggle with new features and physics
Extension for AUTOMATIC1111 to add custom backend API for Krita Plugin & more
This Blender addon is aimed to help you integrate Cascadeur into your workflow.
YOLOv4, YOLOv4-tiny, YOLOv3, YOLOv3-tiny Implemented in Tensorflow 2.0, Android. Convert YOLO v4 .weights tensorflow, tensorrt and tflite
Extension for Automatic1111 and ComfyUI to automatically create masks for Background/Hair/Body/Face/Clothes in Img2Img
This blender Python Script maps an OpenPose Facial Capture to a blender facial Rig
Modern UI for Deep Packet Inspection bypass utils (Windows 10/11)