Stars
A pipeline parallel training script for diffusion models.
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.
Various AI scripts. Mostly Stable Diffusion stuff.
Standardized DataLoaders for 3D Computer Vision
Neon: Serverless Postgres. We separated storage and compute to offer autoscaling, code-like database branching, and scale to zero.
An open-source framework for making universal native apps with React. Expo runs on Android, iOS, and the web.
HunyuanVideo: A Systematic Framework For Large Video Generation Model
Code of Pyramidal Flow Matching for Efficient Video Generative Modeling
Stable Diffusion implemented from scratch in PyTorch
An implementation of various color transfer algorithms.
ComfyUi inside of your Photoshop! you can install the plugin and enjoy free ai genration
A user-friendly plug-in that makes it easy to generate stable diffusion images inside Photoshop using either Automatic or ComfyUI as a backend.
CSGO: Content-Style Composition in Text-to-Image Generation 🔥
Implementation of "ZipLoRA: Any Subject in Any Style by Effectively Merging LoRAs"
Officail Implementation for "Cross-Image Attention for Zero-Shot Appearance Transfer"
Officail Implementation for "ReNoise: Real Image Inversion Through Iterative Noising"
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
InstantStyle-Plus: Style Transfer with Content-Preserving in Text-to-Image Generation 🔥
Easily train a good VC model with voice data <= 10 mins!
Simple Guidance Mechanisms for Discrete Diffusion Models