SDXL Training Framework with Novel Research Methods

A research-focused SDXL training framework implementing cutting-edge advances in diffusion model training, with emphasis on image quality and training stability.

Adding New Training Methods

The framework provides a template for easily implementing new training methods:

# src/training/trainers/methods/example_method.py
class ExampleMethodTrainer(SDXLTrainer):
    name = "example_method"  # Your method's name

    def compute_loss(self, batch: Dict[str, Tensor]) -> Dict[str, Any]:
        """Implement your method's loss computation here."""
        raise NotImplementedError()

To add a new method:

Copy example_method.py template
Implement the compute_loss() method with your training logic
Register method in config.yaml:

training:
  method: "your_method_name"

The template handles all boilerplate code including:

Memory optimizations
Mixed precision training
Gradient accumulation
Progress tracking
Metric logging

Training Methods

Flow Matching with Logit-Normal Sampling [4]

Advanced training method that eliminates noise scheduling:

# Configure Flow Matching training
training:
  method: "flow_matching"
  batch_size: 4
  learning_rate: 1.0e-6

Key benefits:

30% faster convergence via optimal transport paths
Direct velocity field learning reduces instability
No noise schedule dependencies
Logit-normal time sampling for better coverage

NovelAI V3 UNet Architecture [7]

State-of-the-art model improvements:

# Enable NovelAI V3 features
training:
  prediction_type: "v_prediction"
  zero_terminal_snr: true
  sigma_max: 20000.0

Improvements:

Zero Terminal SNR training
- Infinite noise approximation (σ_max ≈ 20000)
- Better artifact handling
- Enhanced detail preservation
V-prediction parameterization
- Stable high-frequency gradients
- Reduced color shifting
Dynamic Karras schedule
- Adaptive noise spacing
- Improved texture quality

Training Monitoring

Training progress is monitored using Weights & Biases (wandb):

Real-time loss tracking
Generated sample visualization
Hyperparameter logging
Custom metric tracking
Experiment comparison

Enable monitoring in config.yaml:

training:
  use_wandb: true

Image Quality Improvements

Enhanced Detail Preservation
- Fine-grained texture generation
- Improved handling of complex patterns
- Better preservation of small objects and features
Color and Lighting
- More accurate color reproduction
- Enhanced dynamic range in highlights and shadows
- Better handling of complex lighting conditions
Composition and Structure
- Improved spatial coherence
- Better handling of perspective and depth
- More consistent object proportions

Requirements

Component	Version
Python	3.8+
CUDA	11.7+
VRAM	24GB+

Installation

# Clone repository
git clone https://github.com/DataCTE/SDXL-Training-Improvements.git
cd SDXL-Training-Improvements

# Install in development mode with all extras
pip install -e ".[dev,docs]"

# Verify installation
python -c "import src; print(src.__version__)"

Configuration

The training framework is configured through a YAML file. Key configuration sections:

# Model configuration
model:
  pretrained_model_name: "stabilityai/stable-diffusion-xl-base-1.0"
  num_timesteps: 1000
  sigma_min: 0.002
  sigma_max: 80.0

# Training parameters  
training:
  batch_size: 4
  learning_rate: 4.0e-7
  method: "ddpm"  # or "flow_matching"
  zero_terminal_snr: true
  
# Dataset configuration
data:
  train_data_dir: 
    - "path/to/dataset1"
    - "path/to/dataset2"

See config.yaml for full configuration options.

Usage Examples

Basic training:

# Train with default config
python src/main.py --config config.yaml

# Train with custom config
python src/main.py --config my_config.yaml

# Distributed training
torchrun --nproc_per_node=2 src/main.py --config config.yaml

References

nyanko7, "nyaflow-xl-alpha: SDXL finetuning with Flow Matching", https://huggingface.co/nyanko7/nyaflow-xl-alpha, 2024
Ossa et al., "Improvements to SDXL in NovelAI Diffusion V3", arXiv:2409.15997, 2024

License

MIT License - see LICENSE file.

Name		Name	Last commit message	Last commit date
Latest commit History 1,808 Commits
docs		docs
src		src
.gitignore		.gitignore
AUTHORS		AUTHORS
LICENSE		LICENSE
README.md		README.md
classifiers.txt		classifiers.txt
keywords.txt		keywords.txt
pyproject.toml		pyproject.toml
requirements-dev.txt		requirements-dev.txt
requirements-docs.txt		requirements-docs.txt
requirements-python.txt		requirements-python.txt
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SDXL Training Framework with Novel Research Methods

Adding New Training Methods

Training Methods

Flow Matching with Logit-Normal Sampling [4]

NovelAI V3 UNet Architecture [7]

Training Monitoring

Image Quality Improvements

Requirements

Installation

Configuration

Usage Examples

References

License

About

Releases

Packages

Contributors 2

Languages

License

DataCTE/SDXL-Training-Improvements

Folders and files

Latest commit

History

Repository files navigation

SDXL Training Framework with Novel Research Methods

Adding New Training Methods

Training Methods

Flow Matching with Logit-Normal Sampling [4]

NovelAI V3 UNet Architecture [7]

Training Monitoring

Image Quality Improvements

Requirements

Installation

Configuration

Usage Examples

References

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages