Stars
Cog wrapper for microsoft/Florence-2-base
MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.
Recommended based on comfyui node pictures:Joy_caption + MiniCPMv2_6-prompt-generator + florence2
[arXiv 2024] Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis
You can call Using Sapiens to get seg,normal,pose,depth,mask
High-resolution models for human tasks.
git-for-windows / git
Forked from git/gitA fork of Git containing Windows-specific patches.
EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
A selection of nodes for Stable Diffusion ComfyUI
An extensive node suite that enables ComfyUI to process 3D inputs (Mesh & UV Texture, etc) using cutting edge algorithms (3DGS, NeRF, etc.)
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Downloads videos and playlists from YouTube
ComfyUI-Manager is an extension designed to enhance the usability of ComfyUI. It offers management functions to install, remove, disable, and enable various custom nodes of ComfyUI. Furthermore, th…
A webui for different audio related Neural Networks
vits2 backbone with multilingual-bert
A simple, high-quality voice conversion tool focused on ease of use and performance.
This is the official repository for M2UGen
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…
🔊 Text-Prompted Generative Audio Model