Stars
[ICCV'21] Learning Spatio-Temporal Transformer for Visual Tracking
This is the third party implementation of the paper Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection.
😎 A list of awesome scene understanding papers.
MiniSora: A community aims to explore the implementation path and future development direction of Sora.
The simplest, fastest repository for training/finetuning medium-sized GPTs.
[CVPR 2024] DiffuScene: Denoising Diffusion Models for Generative Indoor Scene Synthesis
Network estimating 3D Handpose from single color images
3D Hand Shape and Pose Estimation from a Single RGB Image
Collection of awesome resources on image-to-image translation.
[ECCV 2022] Official implementation of the paper "DeciWatch: A Simple Baseline for 10x Efficient 2D and 3D Pose Estimation"
Image augmentation for machine learning experiments.
Nightly release of ControlNet 1.1
Unpaired Image-to-Image Translation with Shortest Path Regularization
This is an official pytorch implementation of Lite-HRNet: A Lightweight High-Resolution Network.
This is an official implementation of our CVPR 2020 paper "HigherHRNet: Scale-Aware Representation Learning for Bottom-Up Human Pose Estimation" (https://arxiv.org/abs/1908.10357)
[BMVC 2020 Oral] Bipartite Graph Reasoning GANs for Person Image Generation
Code & Data for Enhancing Photorealism Enhancement
DomainBed is a suite to test domain generalization algorithms
Transfer learning / domain adaptation / domain generalization / multi-task learning etc. Papers, codes, datasets, applications, tutorials.-迁移学习
UmeTrack Unified multi-view end-to-end hand tracking for VR
Automatic 3D Character animation using Pose Estimation and Landmark Generation techniques
Awesome work on hand pose estimation/tracking
The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement …
🏘️ Scaling Embodied AI by Procedurally Generating Interactive 3D Houses
Infinite Photorealistic Worlds using Procedural Generation
[IJCAI 2022] FQ-ViT: Post-Training Quantization for Fully Quantized Vision Transformer