Skip to content
View GangdanW's full-sized avatar

Block or report GangdanW

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Cog wrapper for microsoft/Florence-2-base

Python 7 1 Updated Jun 25, 2024

ComfyUI Node

Python 309 16 Updated Oct 22, 2024

MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.

Jupyter Notebook 7,147 469 Updated Nov 6, 2024

Recommended based on comfyui node pictures:Joy_caption + MiniCPMv2_6-prompt-generator + florence2

Python 492 28 Updated Jan 20, 2025

[arXiv 2024] Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis

Python 1,031 113 Updated Jan 25, 2025

Hard-fork of the Ryujinx project

C# 2,335 558 Updated Jan 20, 2025

You can call Using Sapiens to get seg,normal,pose,depth,mask

Python 141 5 Updated Dec 5, 2024

High-resolution models for human tasks.

Python 4,780 279 Updated Nov 18, 2024

A fork of Git containing Windows-specific patches.

C 8,511 2,600 Updated Jan 30, 2025

EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning

Python 3,487 396 Updated Dec 10, 2024

You can using EchoMimic in ComfyUI

Python 517 50 Updated Jan 16, 2025

EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation

Python 2,466 286 Updated Jan 27, 2025

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.

Python 27,325 5,601 Updated Jan 30, 2025
Python 227 33 Updated May 22, 2024

A selection of nodes for Stable Diffusion ComfyUI

Python 458 48 Updated Dec 18, 2024

An extensive node suite that enables ComfyUI to process 3D inputs (Mesh & UV Texture, etc) using cutting edge algorithms (3DGS, NeRF, etc.)

Python 2,689 269 Updated Jan 24, 2025

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 65,246 6,971 Updated Jan 30, 2025

Downloads videos and playlists from YouTube

C# 10,093 1,363 Updated Jan 25, 2025

ComfyUI-Manager is an extension designed to enhance the usability of ComfyUI. It offers management functions to install, remove, disable, and enable various custom nodes of ComfyUI. Furthermore, th…

Python 8,211 1,078 Updated Jan 30, 2025

A webui for different audio related Neural Networks

Python 1,112 103 Updated Aug 16, 2024

vits2 backbone with multilingual-bert

Python 8,207 1,163 Updated Jan 27, 2025

A simple, high-quality voice conversion tool focused on ease of use and performance.

Python 2,036 326 Updated Jan 30, 2025

This is the official repository for M2UGen

Jupyter Notebook 467 37 Updated Jan 2, 2025

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

Python 10,084 860 Updated Jul 6, 2024

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Jupyter Notebook 21,391 2,219 Updated Jan 15, 2025

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 36,780 4,328 Updated Aug 19, 2024