Stars
This repo contains a PyTorch implementation with DeepWave for the manuscript FWIGAN: Full-waveform inversion via a physics-informed generative adversarial network
Official code of "DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation"
This paper list focuses on the theoretical and empirical analysis of language models, especially large language models (LLMs). The papers in this list investigate the learning behavior, generalizat…
Easily train a good VC model with voice data <= 10 mins!
制作懂人情世故的大语言模型 | 涵盖提示词工程、RAG、Agent、LLM微调教程
An Open-Source Python3 tool with SMALL models for recognizing layouts, tables, math formulas (LaTeX), and text in images, converting them into Markdown format. A free alternative to Mathpix, empowe…
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬
Restoration for TEMPEST images using deep-learning
Minimal implementation of scalable rectified flow transformers, based on SD3's approach
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
Official PyTorch implementation for "Unifying Bayesian Flow Networks and Diffusion Models through Stochastic Differential Equations"
Large World Model -- Modeling Text and Video with Millions Context
[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…
A collection of awesome text-to-image generation studies.
Diffusion Model-Based Image Editing: A Survey (arXiv)
A collection of resources on controllable generation with text-to-image diffusion models.
(ෆ`꒳´ෆ) A Survey on Text-to-Image Generation/Synthesis.
Papers and resources on Controllable Generation using Diffusion Models, including ControlNet, DreamBooth, IP-Adapter.
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
[TPAMI 2023] Multimodal Image Synthesis and Editing: The Generative AI Era
GGOT: A Gaussian graphical optimal transport method to detecting disease tipping points
To be the world's best PyTorch project template.
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation