Skip to content
View carpedkm's full-sized avatar

Highlights

  • Pro

Block or report carpedkm

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official Code for DragGAN (SIGGRAPH 2023)

Python 35,821 3,455 Updated May 18, 2024

A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.

Python 1,717 67 Updated Jan 2, 2025

A minimal and universal controller for FLUX.1.

Python 1,044 65 Updated Jan 7, 2025

Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation".

Python 6,204 405 Updated Dec 27, 2024

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 10,196 952 Updated Jan 8, 2025

OmniControl: Control Any Joint at Any Time for Human Motion Generation, ICLR 2024

Python 279 19 Updated Jun 14, 2024

[NeurIPS 2024] Official code for "Splatter a Video: Video Gaussian Representation for Versatile Processing"

115 Updated Nov 15, 2024

[SIGGRAPH Asia 2024] ReVersion: Diffusion-Based Relation Inversion from Images

Python 497 19 Updated Dec 1, 2024

CoDe: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient

Python 75 1 Updated Nov 28, 2024

Simple, unified interface to multiple Generative AI providers

Python 9,638 868 Updated Jan 5, 2025

Unofficial Implementation of E-LatentLPIPS(Ensembled-LatentLPIPS) of Diffusion2GAN

Python 40 2 Updated Jul 11, 2024

PFGuard: A Generative Framework with Privacy and Fairness Safeguards

Python 16 Updated Dec 3, 2024

Movie Gen Bench - two media generation evaluation benchmarks released with Meta Movie Gen

359 21 Updated Dec 18, 2024
Python 349 26 Updated Nov 4, 2024
Python 12 1 Updated Sep 28, 2023

A curated list of image inpainting and video inpainting papers and resources

Python 1,955 263 Updated Nov 6, 2024

PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation

Python 1,729 85 Updated Oct 31, 2024

[T-PAMI 2023] Temporal Perceiver: A General Architecture for Arbitrary Boundary Detection

Python 35 1 Updated Aug 29, 2023

GPU & Accelerator process monitoring for AMD, Apple, Huawei, Intel, NVIDIA and Qualcomm

C 8,437 302 Updated Dec 30, 2024

Tool to display AMDGPU usage

Rust 833 18 Updated Jan 5, 2025

MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision

Python 653 35 Updated Dec 8, 2024

Finetune Llama 3.3, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 70% less memory

Python 20,154 1,429 Updated Jan 9, 2025

Next-Token Prediction is All You Need

Python 1,955 77 Updated Oct 24, 2024

Depth Pro: Sharp Monocular Metric Depth in Less Than a Second.

Python 3,943 278 Updated Oct 5, 2024

ISR-DPO:Aligning Large Multimodal Models for Videos by Iterative Self-Retrospective DPO

Python 12 1 Updated Jan 6, 2025

ACL'24 (Oral) Tuning Large Multimodal Models for Videos using Reinforcement Learning from AI Feedback

Python 56 3 Updated Sep 12, 2024

✨✨ MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?

Python 88 6 Updated Nov 22, 2024

[WACV 2024] Training-Free Layout Control with Cross-Attention Guidance

Python 244 15 Updated Mar 18, 2024

HD-Painter: High-Resolution and Prompt-Faithful Text-Guided Image Inpainting with Diffusion Models

Python 301 17 Updated Mar 14, 2024
Next