Lists (1)
Sort Name ascending (A-Z)
Stars
Official PyTorch Implementation of "Optimal Stepsize for Diffusion Sampling".
[CVPR 2025] Multi-focal Conditioned Latent Diffusion for Person Image Synthesis
LBM: Latent Bridge Matching for Fast Image-to-Image Translation ✨
Enhance-A-Video: Better Generated Video for Free
[CVPR 2025 Highlight] GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control
[CVPR 2025] Official repo for ART:Anonymous Region Transformer for Variable Multi-Layer Transparent Image Generation
SpargeAttention: A training-free sparse attention that can accelerate any model inference.
Minimalistic large language model 3D-parallelism training
Original reference implementation of "EVER: Exact Volumetric Ellipsoid Rendering for Real-time View Synthesis"
Inference-time scaling of diffusion-based image and video generation models.
Code release for "LLMs can see and hear without any training"
An open-source implementation of Regional Adaptive Sampling (RAS), a novel diffusion model sampling strategy that introduces regional variability in sampling steps
Video Generation Foundation Models: https://saiyan-world.github.io/goku/
FlashVideo: Flowing Fidelity to Detail for Efficient High-Resolution Video Generation
Official code of "LayerTracer: Cognitive-Aligned Layered SVG Synthesis via Diffusion Transformer"
Simple go utility to download HuggingFace Models and Datasets
Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation" (CVPR'25 Spotlight).
https://wavespeed.ai/ Context parallel attention that accelerates DiT model inference with dynamic caching
Rembg is a tool to remove images background
Implementation of "Multimodal Color Recommendation for Vector Graphic Documents" ACM MM'23
[CVPR 2024 Oral] Official repository for RALF: Retrieval-Augmented Layout Transformer for Content-Aware Layout Generation
[CVPR 2024] Official implementation of "SuperNormal: Neural Surface Reconstruction via Multi-View Normal Integration"
Python binding to Skia Graphics Library
Official PyTorch implementation for TOMM24 "SwinShadow: Shifted Window for Ambiguous Adjacent Shadow Detection"
Official PyTorch implementation for TCSVT 23 "Detect Any Shadow: Segment Anything for Video Shadow Detection"