v0.9.0-alpha - multi-dataset edition #249
bghira
announced in
Announcements
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Changes
There's more info about these in
OPTIONS.md
andTUTORIAL.md
.New behaviour
"scan_for_errors": true
it will be read entirely at startup and any bad images will be removed ifdelete_problematic_images: true
. It will remove any outdated cache entries."disabled": true
in the dataset config entry.Removed arguments
multidatabackend.json
--data_backend
is now--data_backend_config
and is a path to a dataset config, seemultidatabackend.json.example
for help converting your existing configurations overNew arguments
--data_backend_config
DATALOADER_CONFIG
insdxl-env.sh
--override_dataset_config
--vae_cache_behaviour
.json
cache files from your dataset and switch the data backend config to use square images rather than aspect-crops. This will result in an inconsistent data cache, which can be corrected by settingscan_for_errors
totrue
in yourmultidatabackend.json
configuration file. When this scan runs, it relies on the setting of--vae_cache_behaviour
to determine how to resolve the inconsistency:recreate
(the default) will remove the offending cache entry so that it can be recreated, andsync
will update the bucket metadata to reflect the reality of the real training sample. Recommended value:recreate
.Full Changelog: v0.8.2...v0.9.0-alpha
This discussion was created from the release v0.9.0-alpha - multi-dataset edition.
Beta Was this translation helpful? Give feedback.
All reactions