Lists (1)
Sort Name ascending (A-Z)
Stars
beautifulprompt extension performs stable diffusion automatic prompt engineering on a browser UI.
JoyCaption is an image captioning Visual Language Model (VLM) being built from the ground up as a free, open, and uncensored model for the community to use in training Diffusion models.
picobyte / stable-diffusion-webui-wd14-tagger
Forked from toriato/stable-diffusion-webui-wd14-taggerLabeling extension for Automatic1111's Web UI
Linux virtual machines, with a focus on running containers
Wan: Open and Advanced Large-Scale Video Generative Models
[CVPR2024] StableVITON: Learning Semantic Correspondence with Latent Diffusion Model for Virtual Try-On
Virtual whiteboard for sketching hand-drawn like diagrams
✨✨Latest Papers and Datasets on Mobile and PC GUI Agent
Mobile-Agent: The Powerful Mobile Device Operation Assistant Family
Open-sourced, Fast and Context-aware Action Grounding from GUI Instructions for GUI/Computer-use Agents
AI as Workspace - A better AI (LLM) client. Full-featured, lightweight. Support multiple workspaces, plugin system, cross-platform, local first + real-time cloud sync, Artifacts, MCP | 更好的 AI 客户端
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
wlxj1992 / StyleSelectorXL
Forked from ahgsql/StyleSelectorXLThis repository contains a Automatic1111 Extension allows users to select and apply different styles to their inputs using SDXL 1.0.
Custom prompt styler node for SDXL in ComfyUI
This repository contains a Automatic1111 Extension allows users to select and apply different styles to their inputs using SDXL 1.0.
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥
Convert ebooks to audiobooks with chapters and metadata using dynamic AI models and voice cloning. Supports 1,107+ languages!
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation
[ECCV2024] IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild
BiRefNet for AUTOMATIC1111 Stable Diffusion WebUI