Stars
Open and extensible continuous delivery solution for Kubernetes. Powered by GitOps Toolkit.
ComfyUI custom nodes for Ovis2 multimodal model integration
The nodes detached from [ComfyUI Layer Style](https://github.com/chflame163/ComfyUI_LayerStyle) are mainly those with complex requirements for dependency packages.
SwarmUI (formerly StableSwarmUI), A Modular Stable Diffusion Web-User-Interface, with an emphasis on making powertools easily accessible, high performance, and extensibility.
This custom_node for ComfyUI adds one-click "Virtual VRAM" for any GGUF UNet and CLIP loader, managing the offload of layers to DRAM or VRAM to maximize the latent space of your card. Also includes…
For unloading a model or all models, using the memory management that is already present in ComfyUI. Copied from https://github.com/willblaschko/ComfyUI-Unload-Models but without the unnecessary ex…
State-of-the-art 2D and 3D Face Analysis Project
Official repository of In-Context LoRA for Diffusion Transformers
Image editing is worth a single LoRA! 0.1% training data for fantastic image editing! Training released! Surpasses GPT-4o in ID persistence! Official ComfyUI workflow release! Only 4GB VRAM is enou…
Research code of ICCV 2021 paper "Mesh Graphormer"
HandFixer,一键手部修复工作流,ComfyUI, Hand reapair
You can InstantIR to upsacel image in ComfyUI ,InstantIR,Blind Image Restoration with Instant Generative Reference
A set of nodes for ComfyUI that can composite layer and mask to achieve Photoshop like functionality.
Run ComfyUI workflows on multiple local GPUs/networked machines.
xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism
ComfyUI's ControlNet Auxiliary Preprocessors
A GUI Agent application based on UI-TARS(Vision-Language Model) that allows you to control your computer using natural language.
Fast and extensible multi-platform HTTP/1-2-3 web server with automatic HTTPS
Convert Any OpenAPI V3 API to MCP Server
Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.
Like Manus, Computer Use Agent(CUA) and Omniparser, we are computer-using agents.AI-driven local automation assistant that uses natural language to make computers work by themselves
A simple screen parsing tool towards pure vision based GUI agent