Lists (1)
Sort Name ascending (A-Z)
Starred repositories
Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation".
A generative world for general-purpose robotics & embodied AI learning.
Workflow-to-APP、ScreenShare&FloatingVideo、GPT & 3D、SpeechRecognition&TTS
Unofficial implementation of InstantID for ComfyUI
A tiny C++11 library for reading BVH motion capture data
Real-Time and Accurate Full-Body Multi-Person Pose Estimation&Tracking System
The official repo for [NeurIPS'22] "ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation" and [TPAMI'23] "ViTPose++: Vision Transformer for Generic Body Pose Estimation"
Official implementations for paper: Zero-shot Image Editing with Reference Imitation
Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.
Accepted as [NeurIPS 2024] Spotlight Presentation Paper
[CVPR'24] Interactive3D: Create What You Want by Interactive 3D Generation
kapture is a file format as well as a set of tools for manipulating datasets, and in particular Visual Localization and Structure from Motion data.
FaceChain is a deep-learning toolchain for generating your Digital-Twin.
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
Robust Speech Recognition via Large-Scale Weak Supervision
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
A plugin to add 360 and VR video support to video.js.
Text To Video Synthesis Colab
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
Cross-platform, customizable ML solutions for live and streaming media.
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
From comfyui workflow to web app, in seconds