Stable Diffusion
[ICCV 2023 Oral] Text-to-Image Diffusion Models are Zero-Shot Video Generators
Text2Room generates textured 3D meshes from a given text prompt using 2D text-to-image models (ICCV2023).
Stable Diffusion web UI
Transparent Image Layer Diffusion using Latent Transparency
🔥🔥Official Repository for Multi-Human-Parsing (MHP)🔥🔥
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
Open-Sora: Democratizing Efficient Video Production for All
Bird's eye/Top Down view generation and mapping with deep learning.
[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)
🆙 Upscayl - #1 Free and Open Source AI Image Upscaler for Linux, MacOS and Windows.
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Accepted as [NeurIPS 2024] Spotlight Presentation Paper
Demonstration of Wasp and Supabase working together, using Llama 3 and SDXL to generate greeting cards!
LAVIS - A One-stop Library for Language-Vision Intelligence
An archived repository of the Google Blocks source code: a VR creation app originally released for the HTC Vive and Oculus Rift
AuraSR: GAN-based Super-Resolution for real-world
📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion
Code for Paper "UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation".