Official PyTorch implementation of "Redistributing the Precision and Content in 3D-LUT-based Inverse Tone-mapping for HDR/WCG Display" in CVMP2023 (SIGGRAPH European Conference on Visual Media Prod…

C++ 43 3 Updated May 11, 2024

hyliu / piggyback-color

Improved Diffusion-based Image Colorization via Piggybacked Models

Jupyter Notebook 65 3 Updated May 31, 2023

ErwannMillon / Color-diffusion

A diffusion model to colorize black and white images

Python 758 25 Updated Aug 8, 2023

open-mmlab / StyleShot

StyleShot: A SnapShot on Any Style. 一款可以迁移任意风格到任意内容的模型，无需针对图片微调，即能生成高质量的个性风格化图片!

Python 317 20 Updated Sep 9, 2024

fishaudio / fish-speech

SOTA Open Source TTS

Python 18,865 1,428 Updated Feb 3, 2025

imputnet / cobalt

best way to save what you love

Svelte 27,416 2,206 Updated Feb 6, 2025

yeates / PromptFix

[NeurIPS 24] PromptFix: You Prompt and We Fix the Photo

Python 707 38 Updated Oct 4, 2024

ANYANTUDRE / Florence-2-Vision-Language-Model

Florence-2 is a novel vision foundation model with a unified, prompt-based representation for a variety of computer vision and vision-language tasks.

Jupyter Notebook 25 2 Updated Jul 3, 2024

YihanHu-2022 / DiffMatte

Python 86 6 Updated Jul 4, 2024

yeungchenwa / OCR-SAM

Combining MMOCR with Segment Anything & Stable Diffusion. Automatically detect, recognize and segment text instances, with serval downstream tasks, e.g., Text Removal and Text Inpainting

Python 550 37 Updated Jan 30, 2024

IDEA-Research / Grounding-DINO-1.5-API

Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series

Python 875 31 Updated Jan 21, 2025

Tencent / MimicMotion

High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance

Python 2,152 181 Updated Sep 23, 2024

open-mmlab / Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 8,419 644 Updated Feb 3, 2025

lm-sys / FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 37,672 4,610 Updated Feb 6, 2025

gpu-mode / lectures

Material for gpu-mode lectures

Jupyter Notebook 3,637 369 Updated Jan 6, 2025

YangLing0818 / IterComp

[ICLR 2025] IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation

Python 157 10 Updated Feb 1, 2025

shyjal / visual-try-on

A chrome extension to easily do visual trials of clothing from any e-commerce store. Here is the easy to use install option 👇

JavaScript 757 131 Updated Nov 15, 2024

replicate / flux-fine-tuner

Cog wrapper for ostris/ai-toolkit + post-finetuning cog inference for flux models

Python 347 52 Updated Jan 24, 2025

Zheng-Chong / CatVTON

[ICLR 2025] CatVTON is a simple and efficient virtual try-on diffusion model with 1) Lightweight Network (899.06M parameters totally), 2) Parameter-Efficient Training (49.57M parameters trainable) …

Python 1,128 137 Updated Jan 24, 2025

replicate / cog

Containers for machine learning

Python 8,348 577 Updated Feb 5, 2025

xlmnxp / Qocker

Qocker is a user-friendly GUI application for managing Docker containers. Built with PyQt5, it provides an intuitive interface for viewing and interacting with your Docker containers.

Python 205 11 Updated Dec 17, 2024

xdit-project / xDiT

xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism

Python 1,209 101 Updated Jan 24, 2025

bghira / SimpleTuner

A general fine-tuning kit geared toward diffusion models.

Python 2,059 197 Updated Jan 30, 2025

Acly / comfyui-inpaint-nodes

Nodes for better inpainting with ComfyUI: Fooocus inpaint model for SDXL, LaMa, MAT, and various other tools for pre-filling inpaint & outpaint areas.

Python 807 47 Updated Nov 20, 2024