Lists (7)
Sort Name ascending (A-Z)
Starred repositories
🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.
A ComfyUI node for driving videos using batches of images.
LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Models
[ACM MM 2021 Best Paper Award] Video Background Music Generation with Controllable Music Transformer
Official PyTorch implementation of the “Spatial-Semantic Collaborative Cropping for User Generated Content”. (AAAI24)
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
[ACMMM 2023, Official Code] for paper "EAT: An Enhancer for Aesthetics-Oriented Transformers". Official Weights and Demos provided. 目前是地表最强开源美学评估模型之一.
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
Paint by Inpaint: Learning to Add Image Objects by Removing Them First
collection of diffusion model papers categorized by their subareas
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Official repo for Artist: Aesthetically Controllable Text-Driven Stylization without Training
[CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models
Diffree: Text-Guided Shape Free Object Inpainting with Diffusion Model
Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
Paper 'Transformer based Pluralistic Image Completion with Reduced Information Loss' in TPAMI 2024 and 'Reduce Information Loss in Transformers for Pluralistic Image Inpainting' in CVPR2022
Official PyTorch Code and Models of "Minutes to Seconds: Speeded-up DDPM-based Image Inpainting with Coarse-to-Fine Sampling", ICME 2024
A Collection of Papers and Codes for CVPR2024/CVPR2021/CVPR2020 Low Level Vision
A Collection of Papers and Codes for CVPR2024/ECCV2024 AIGC
SEED-Story: Multimodal Long Story Generation with Large Language Model