Lists (25)
Sort Name ascending (A-Z)
3D Gen
AR Image Generation
Yetodatasets
DeepFake Detection
DeepFake DetectionDiffusion Backbone
Diffusion Control
Diffusion Guidance
Diffusion Optimization
Diffusion Quality Enhancement
Face
Image Editting
image enhancement
Image-Text
Inpainting
Low Computation Less Data Train
mmsy
Optimization
others
setup
Super Resolution
Translate
Video - General Tasks
Video Generation
Video Style
Voice
Stars
Official repository for our work on micro-budget training of large-scale diffusion models.
✨✨VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction
🔥🔥🔥 This repository includes latest papers, projects and datasets on GenAI for Cel-Animation.
[ICPR 2024] Official repository of the paper "GenFormer - Generated Images are All You Need to Improve Robustness of Transformers on Small Datasets"
Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…
[arXiv 2024] Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis
Stay on top of trending topics on social media and the web with AI
Official Repo for Paper "AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea"
A curated list of resources for learning and exploring Triton, OpenAI's programming language for writing efficient GPU code.
DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought
A minimal and universal controller for FLUX.1.
Illumination Drawing Tools for Text-to-Image Diffusion Models
End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 training).
A generative world for general-purpose robotics & embodied AI learning.
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
This is a PyTorch-based reimplementation of CrossFlow, as proposed in 'Flowing from Words to Pixels: A Framework for Cross-Modality Evolution'
The official implementation of paper "BrushEdit: All-In-One Image Inpainting and Editing"
A suite of image and video neural tokenizers
Official code for DiFaReli
Nodes for image juxtaposition for Flux in ComfyUI
Taming FLUX for Image Inversion & Editing; OpenSora for Video Inversion & Editing! (Official implementation for Taming Rectified Flow for Inversion and Editing.)
[ARXIV'24] StyleMaster: Stylize Your Video with Artistic Generation and Translation
The official implementation of paper: DreamMix: Decoupling Object Attributes for Enhanced Editability in Customized Image Inpainting