Stars
Official implementation of "DepthLab: From Partial to Complete"
Gourieff / sd-webui-reactor
Forked from s0md3v/sd-webui-roopFast and Simple Face Swap Extension for StableDiffusion WebUI (A1111 SD WebUI, SD WebUI Forge, SD.Next, Cagliostro)
Industry leading face manipulation platform
A generative world for general-purpose robotics & embodied AI learning.
DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding
DepthSplat: Connecting Gaussian Splatting and Depth
A collection of awesome video generation studies.
✨✨Latest Advances on Multimodal Large Language Models
3DGS-based change detection for physical object rearrangement
Official code for paper: Synergizing Motion and Appearance: Multi-Scale Compensatory Codebooks for Talking Head Video Generation
(CVPR 2024) Official code for paper "Towards Language-Driven Video Inpainting via Multimodal Large Language Models"
The repo for "Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image" and "Metric3Dv2: A Versatile Monocular Geometric Foundation Model..."
A Point Transformer with Federated Learning for HER2 Status Prediction
🚀 [ARXIV 2024] Pytorch implementation of 'Fast Feedforward 3D Gaussian Splatting Compression'
Surf-D: Generating High-Quality Surfaces of Arbitrary Topologies Using Diffusion Models (ECCV 2024)
Layout-Guided multi-view driving scene video generation with latent diffusion model
The official release of JointDreamer (ECCV24 poster)
Official codes for paper: 3DGS-DET: Empower 3D Gaussian Splatting with Boundary Guidance and Box-Focused Sampling for 3D Object Detection
DreamScene: 3D Gaussian-based Text-to-3D Scene Generation via Formation Pattern Sampling
[ECCV24] Official code for RoomTex: Texturing Compositional Indoor Scenes via Iterative Inpainting
This is the official repo of the paper "Latent Guard: a Safety Framework for Text-to-image Generation"