GitHub - stepover/sd-webui-controlnet: WebUI extension for ControlNet

sd-webui-controlnet

(WIP) WebUI extension for ControlNet and T2I-Adapter

This extension is for AUTOMATIC1111's Stable Diffusion web UI, allows the Web UI to add ControlNet to the original Stable Diffusion model to generate images. The addition is on-the-fly, the merging is not required.

ControlNet is a neural network structure to control diffusion models by adding extra conditions.

Thanks & Inspired by: kohya-ss/sd-webui-additional-networks

Limits

Dragging large file on the Web UI may freeze the entire page. It is better to use the upload file option instead.
Just like WebUI's hijack, we used some interpolate to accept arbitrary size configure (see scripts/cldm.py)

Install

Open "Extensions" tab.
Open "Install from URL" tab in the tab.
Enter URL of this repo to "URL for extension's git repository".
Press "Install" button.
Reload/Restart Web UI.

Upgrade gradio if any ui issues occured: pip install gradio==3.16.2

Usage

Put the ControlNet models (.pt, .pth, .ckpt or .safetensors) inside the models/ControlNet folder.
Open "txt2img" or "img2img" tab, write your prompts.
Press "Refresh models" and select the model you want to use. (If nothing appears, try reload/restart the webui)
Upload your image and select preprocessor, done.

Currently it supports both full models and trimmed models. Use extract_controlnet.py to extract controlnet from original .pth file.

ControlNet 1.1 is in the beta test.

Right now 12 models of ControlNet 1.1 are in the beta test (all models expect the inpaint and tile).

Download models from ControlNet 1.1: https://huggingface.co/lllyasviel/ControlNet-v1-1/tree/main

(If you download models elsewhere, please make sure that yaml file names and model files names are same. Please manually rename all yaml files if you download from other sources. Otherwise, models may have unexpected behaviors.) Some 3rd-party CivitAI and fp16 models are renamed randomly, making YAML files mismatch. The performance of some of these models (like shuffle) will be significantly worse than official ones. Please download models from our huggingface website with correct YAML file names.

Documents of ControlNet 1.1: https://github.com/lllyasviel/ControlNet-v1-1-nightly

In 1.1, the previous depth is now called "depth_midas", the previous normal is called "normal_midas", the previous "hed" is called "softedge_edge". And starting from 1.1, all line maps, edge maps, lineart maps, boundary maps will have black background and white lines.

Previous Models

Big Models: https://huggingface.co/lllyasviel/ControlNet/tree/main/models

Small Models: https://huggingface.co/webui/ControlNet-modules-safetensors

Tips

Regarding canvas height/width: they are designed for canvas generation. If you want to upload images directly, you can safely ignore them.

Examples

Source	Input	Output
(no preprocessor)
(no preprocessor)

T2I-Adapter Support

(From TencentARC/T2I-Adapter)

T2I-Adapter is a small network that can provide additional guidance for pre-trained text-to-image models.

To use T2I-Adapter models:

Download files from https://huggingface.co/TencentARC/T2I-Adapter
Copy corresponding config file and rename it to the same name as the model - see list below.
It's better to use a slightly lower strength (t) when generating images with sketch model, such as 0.6-0.8. (ref: ldm/models/diffusion/plms.py)

Adapter	Config
t2iadapter_canny_sd14v1.pth	sketch_adapter_v14.yaml
t2iadapter_sketch_sd14v1.pth	sketch_adapter_v14.yaml
t2iadapter_seg_sd14v1.pth	image_adapter_v14.yaml
t2iadapter_keypose_sd14v1.pth	image_adapter_v14.yaml
t2iadapter_openpose_sd14v1.pth	image_adapter_v14.yaml
t2iadapter_color_sd14v1.pth	t2iadapter_color_sd14v1.yaml
t2iadapter_style_sd14v1.pth	t2iadapter_style_sd14v1.yaml

Note:

This implement is experimental, result may differ from original repo.
Some adapters may have mapping deviations (see issue lllyasviel/ControlNet#255)

Adapter Examples

Source	Input	Output
(no preprocessor)
(no preprocessor)
(no preprocessor)
(no preprocessor)

	(clip, non-image)

Examples by catboxanon, no tweaking or cherrypicking. (Color Guidance)

Image	Disabled	Enabled

Minimum Requirements

(Windows) (NVIDIA: Ampere) 4gb - with --xformers enabled, and Low VRAM mode ticked in the UI, goes up to 768x832

Guess Mode (Non-Prompt Mode, Experimental)

Guess Mode is CFG Based ControlNet + Exponential decay in weighting.

See issue Mikubill#236 for more details.

Original introduction from controlnet:

The "guess mode" (or called non-prompt mode) will completely unleash all the power of the very powerful ControlNet encoder.

In this mode, you can just remove all prompts, and then the ControlNet encoder will recognize the content of the input control map, like depth map, edge map, scribbles, etc.

This mode is very suitable for comparing different methods to control stable diffusion because the non-prompted generating task is significantly more difficult than prompted task. In this mode, different methods' performance will be very salient.

For this mode, we recommend to use 50 steps and guidance scale between 3 and 5.

Multi-ControlNet / Joint Conditioning (Experimental)

This option allows multiple ControlNet inputs for a single generation. To enable this option, change Multi ControlNet: Max models amount (requires restart) in the settings. Note that you will need to restart the WebUI for changes to take effect.

Guess Mode will apply to all ControlNet if any of them are enabled.

Source A	Source B	Output

Weight and Guidance Strength/Start/End

Weight is the weight of the controlnet "influence". It's analogous to prompt attention/emphasis. E.g. (myprompt: 1.2). Technically, it's the factor by which to multiply the ControlNet outputs before merging them with original SD Unet.

Guidance Start/End is the percentage of total steps the controlnet applies (guidance strength = guidance end). It's analogous to prompt editing/shifting. E.g. [myprompt::0.8] (It applies from the beginning until 80% of total steps)

API/Script Access

This extension can accept txt2img or img2img tasks via API or external extension call. Note that you may need to enable Allow other scripts to control this extension in settings for external calls.

To use the API: start WebUI with argument --api and go to http://webui-address/docs for documents or checkout examples.

To use external call: Checkout Wiki

MacOS Support

Tested with pytorch nightly: Mikubill#143 (comment)

To use this extension with mps and normal pytorch, currently you may need to start WebUI with --no-half.

Example: Visual-ChatGPT (by API)

Quick start:

# Run WebUI in API mode
python launch.py --api --xformers

# Install/Upgrade transformers
pip install -U transformers

# Install deps
pip install langchain==0.0.101 openai 

# Run exmaple
python example/chatgpt.py

Name		Name	Last commit message	Last commit date
Latest commit History 752 Commits
.github		.github
annotator		annotator
example		example
models		models
samples		samples
scripts		scripts
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
extract_controlnet.py		extract_controlnet.py
extract_controlnet_diff.py		extract_controlnet_diff.py
install.py		install.py
preload.py		preload.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

sd-webui-controlnet

Limits

Install

Usage

ControlNet 1.1 is in the beta test.

Previous Models

Tips

Examples

T2I-Adapter Support

Adapter Examples

Minimum Requirements

Guess Mode (Non-Prompt Mode, Experimental)

Multi-ControlNet / Joint Conditioning (Experimental)

Weight and Guidance Strength/Start/End

API/Script Access

MacOS Support

Example: Visual-ChatGPT (by API)

About

Releases

Packages

Languages

License

stepover/sd-webui-controlnet

Folders and files

Latest commit

History

Repository files navigation

sd-webui-controlnet

Limits

Install

Usage

ControlNet 1.1 is in the beta test.

Previous Models

Tips

Examples

T2I-Adapter Support

Adapter Examples

Minimum Requirements

Guess Mode (Non-Prompt Mode, Experimental)

Multi-ControlNet / Joint Conditioning (Experimental)

Weight and Guidance Strength/Start/End

API/Script Access

MacOS Support

Example: Visual-ChatGPT (by API)

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages