Skip to content
View Gymat's full-sized avatar

Block or report Gymat

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

CatV2TON is a lightweight DiT-based visual virtual try-on model, capable of supporting try-on for both images and videos.

Python 114 6 Updated Feb 24, 2025

🔥🔥 UNO: A Universal Customization Method for Both Single and Multi-Subject Conditioning

Python 486 15 Updated Apr 12, 2025

OmniPaint: Mastering Object-Oriented Editing via Disentangled Insertion-Removal Inpainting

171 7 Updated Mar 31, 2025

Code and dataset for "Detecting Human Artifacts from Text-to-Image Models"

Python 20 Updated Dec 26, 2024
Python 948 58 Updated Mar 22, 2025

HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo

Python 1,299 103 Updated Mar 28, 2025

[ICLR 2025] CatVTON is a simple and efficient virtual try-on diffusion model with 1) Lightweight Network (899.06M parameters totally), 2) Parameter-Efficient Training (49.57M parameters trainable) …

Python 1,300 154 Updated Feb 24, 2025

A powerful tool that translates ComfyUI workflows into executable Python code.

Python 1,699 159 Updated Jan 14, 2025

StoryMaker: Towards consistent characters in text-to-image generation

Python 686 58 Updated Dec 2, 2024

SynCD: Generating Multi-Image Synthetic Data for Text-to-Image Customization

Python 129 13 Updated Mar 18, 2025

HermesFlow: Seamlessly Closing the Gap in Multimodal Understanding and Generation

Python 54 3 Updated Feb 18, 2025

SkyReels V1: The first and most advanced open-source human-centric video foundation model

Python 1,984 190 Updated Mar 10, 2025

AcadHomepage: A Modern and Responsive Academic Personal Homepage

SCSS 1,862 3,657 Updated Apr 11, 2025

High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.

Python 8,399 700 Updated Apr 2, 2025

VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation

Python 215 6 Updated Mar 26, 2025

Various AI scripts. Mostly Stable Diffusion stuff.

Python 4,498 504 Updated Apr 12, 2025
Python 84 5 Updated Nov 27, 2024

A Video Tokenizer Evaluation Dataset

Python 111 8 Updated Jan 13, 2025

New repo collection for NVIDIA Cosmos: https://github.com/nvidia-cosmos

Jupyter Notebook 7,904 507 Updated Apr 2, 2025

Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.

Python 29,286 3,477 Updated Apr 13, 2025

Official code of SmartEdit [CVPR-2024 Highlight]

Python 320 11 Updated Jun 21, 2024

A collection of vision foundation models unifying understanding and generation.

48 2 Updated Jan 2, 2025

Let's finetune video generation models!

Python 441 21 Updated Apr 11, 2025

DeepFashion2 Dataset https://arxiv.org/pdf/1901.07973.pdf

Jupyter Notebook 2,400 370 Updated Jan 28, 2025

OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340

Jupyter Notebook 3,927 337 Updated Feb 20, 2025

Educational implementation of the Discrete Flow Matching paper

Jupyter Notebook 83 6 Updated Aug 26, 2024

[CVPR 2025] 🔥 Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".

Python 306 1 Updated Mar 5, 2025

A minimal and universal controller for FLUX.1.

Python 1,440 99 Updated Apr 12, 2025
Next