-
Video Coding Laboratory, Peking University
- https://lotayou.github.io
Lists (1)
Sort Name ascending (A-Z)
Starred repositories
🎁 6,500,000+ Unsplash images made available for research and machine learning
https://wavespeed.ai/ Context parallel attention that accelerates DiT model inference with dynamic caching
Official Implementation for "Encoding in Style: a StyleGAN Encoder for Image-to-Image Translation" (CVPR 2021) presenting the pixel2style2pixel (pSp) framework
[ICCV 2023] StyleGANEX: StyleGAN-Based Manipulation Beyond Cropped Aligned Faces
The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention
Illumination Drawing Tools for Text-to-Image Diffusion Models
[CVPR 2025 Highlight🔥] Identity-Preserving Text-to-Video Generation by Frequency Decomposition
InstantID-ROME: Improved Identity-Preserving Generation in Seconds 🔥
High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
Custom nodes pack for ComfyUI This custom node helps to conveniently enhance images through Detector, Detailer, Upscaler, Pipe, and more.
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
A curated list of papers, code and resources pertaining to image composition/compositing or object insertion/addition/compositing, which aims to generate realistic composite image.
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
Python tools for 3D face: 3DMM, Mesh processing(transform, camera, light, render), 3D face representations.
Pytorch Implementation of: "Stable-Hair: Real-World Hair Transfer via Diffusion Model" (AAAI 2025)
[CVPR 2024 Highlight] The official repo for "GaussianAvatars: Photorealistic Head Avatars with Rigged 3D Gaussians"
A Trimap-Free Portrait Matting Solution in Real Time [AAAI 2022]
Enjoy the magic of Diffusion models!
[ICLR 2025] Official implementation of MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance
Official implementations for paper: Anydoor: zero-shot object-level image customization
High-Resolution Image Synthesis with Latent Diffusion Models
[NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment
Unofficial PyTorch Implementation for HifiFace (https://arxiv.org/abs/2106.09965)
A fast poisson image editing implementation that can utilize multi-core CPU or GPU to handle a high-resolution image input.
Official PyTorch implementation of ECCV 2024 Paper: ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback.
InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models
[ECCV 2024] codes of DiffBIR: Towards Blind Image Restoration with Generative Diffusion Prior