Stars
HunyuanVideo: A Systematic Framework For Large Video Generation Model
Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2
a state-of-the-art-level open visual language model | 多模态预训练模型
[NeurIPS 2024🔥] DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset Curation
A curated list of papers, code, and resources pertaining to object shadow generation.
A curated list of papers, code and resources pertaining to image composition/compositing or object insertion, which aims to generate realistic composite image.
freeCodeCamp.org's open-source codebase and curriculum. Learn to code for free.
OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340
A Survey on Image and Video Shadow Detection, Removal, and Generation in the Era of Deep Learning (Awesome & Benchmark)
A set of nodes for ComfyUI that can composite layer and mask to achieve Photoshop like functionality.
pytorch单精度、半精度、混合精度、单卡、多卡(DP / DDP)、FSDP、DeepSpeed模型训练代码,并对比不同方法的训练速度以及GPU内存的使用
Official implementations for paper: Zero-shot Image Editing with Reference Imitation
A curated list of papers, code and resources pertaining to image harmonization.
[CVPR2024] The code for "Osprey: Pixel Understanding with Visual Instruction Tuning"
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
Lumina-T2X is a unified framework for Text to Any Modality Generation
Accepted as [NeurIPS 2024] Spotlight Presentation Paper
Implementation of Z. Farbman, R. Fattal, D. Lischinski, and R. Szeliski, 'Edge-Preserving Decompositions for Multi-Scale Tone and Detail Manipulation' (2008)
A simple example for using `DDIMInverseScheduler` for inverting an input image to StableDiffusion's latent space
A simple extension of Controlnet for color condition
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
[CVPR 2020] The first large-scale public benchmark dataset for image harmonization. The code used in our paper "DoveNet: Deep Image Harmonization via Domain Verification", CVPR2020. Useful for imag…