Starred repositories
A latent text-to-image diffusion model
Examples and guides for using the Gemini API
Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340
A simple notebook demonstrating prompt-based music generation via Mubert API
SDK for interacting with stability.ai APIs (e.g. stable diffusion inference)
Text To Video Synthesis Colab
Jupyter notebooks for Paperspace.
Symphony Generation with Permutation Invariant Language Model
A Real-ESRGAN equipped Colab notebook for CLIP Guided Diffusion
GPT3-based Multi-Instrumental MIDI Music AI Implementation
When Dall E was a baby trained on a bit of data
Exterior design using stable-diffusion 🏡