-
Institute of Automation, Chinese Academy of Sciences
- Beijing, China
- https://wang-zidu.github.io/
- https://scholar.google.com/citations?user=7zD5f0IAAAAJ&hl=zh-CN&oi=sra
Stars
SynShot - Synthetic Prior for Few-Shot Drivable Head Avatar Inversion [CVPR 2025]
Code for 3D-LLM: Injecting the 3D World into Large Language Models
[CVPR'25] Official Implementation of MambaIC: State Space Models for High-Performance Learned Image Compression
ETCH: Generalizing Body Fitting to Clothed Humans via Equivariant Tightness
[CVPR 2025] Source codes for the paper "3D-Mem: 3D Scene Memory for Embodied Exploration and Reasoning"
Official repo of "StrandHead: Text to Strand-Disentangled 3D Head Avatars Using Hair Geometric Priors“
[ICLR 2025] Official implementation of "Perm: A Parametric Representation for Multi-Style 3D Hair Modeling"
Official implementation for the SIGGRAPH Asia 2024 paper SPARK: Self-supervised Personalized Real-time Monocular Face Capture
Implicit Shape and Appearance Priors for Few-Shot Full Head Reconstruction (SIRA++)
[CVPR2025] Progressive Rendering Distillation: Adapting Stable Diffusion for Instant Text-to-Mesh Generation without 3D Data
Latest Advances on System-2 Reasoning
ChartCoder: Advancing Multimodal Large Language Model for Chart-to-Code Generation
Blender Python PLY importer for point clouds and nonstandard models.
[ICLR 2025] Official Implementation of Local-Prompt: Extensible Local Prompts for Few-Shot Out-of-Distribution Detection
This repository will host the code for the SIGGRAPH Asia 2024 Paper titled: "GaussianHeads: End-to-End Learning of Drivable Gaussian Head Avatars from Coarse-to-fine Representations"
This is the official code of AAAI 2025: RetouchGPT: LLM-based Interactive High-Fidelity Face Retouching via Imperfection Prompting.
[CVPR 2024] An End-to-End Tile-Based Framework for High-Resolution Monocular Metric Depth Estimation
Official implementation for MVBoost: Boost 3D Reconstruction with Multi-View Refinement
Code for SIGGRAPH2024 paper "ContourCraft: Learning to Resolve Intersections in Neural Multi-Garment Simulations"
The official implementation of "MeGA: Hybrid Mesh-Gaussian Head Avatar for High-Fidelity Rendering and Head Editing".
Official implementation for "UniST: A Prompt-Empowered Universal Model for Urban Spatio-Temporal Prediction" (KDD 2024)
The official repository of our CVPR2023 paper "FFHQ-UV: Normalized Facial UV-Texture Dataset for 3D Face Reconstruction".
This repository contains the source code for the paper First Order Motion Model for Image Animation
Offical code of TECA: Text-Guided Generation and Editing of Compositional 3D Avatars