Skip to content
View szpnygo's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report szpnygo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation".

Python 7,438 519 Updated Dec 27, 2024

A generative world for general-purpose robotics & embodied AI learning.

Python 23,583 1,998 Updated Feb 5, 2025

HarmonyOS Next Hap installer

1,239 65 Updated Jan 9, 2025

Workflow-to-APP、ScreenShare&FloatingVideo、GPT & 3D、SpeechRecognition&TTS

JavaScript 1,428 95 Updated Feb 5, 2025

眼科问诊大模型

Python 84 17 Updated Jul 16, 2024

Unofficial implementation of InstantID for ComfyUI

Python 1,375 79 Updated May 22, 2024

A tiny C++11 library for reading BVH motion capture data

C++ 40 8 Updated Jul 26, 2021

Real-Time and Accurate Full-Body Multi-Person Pose Estimation&Tracking System

Python 8,152 1,985 Updated May 13, 2024

The official repo for [NeurIPS'22] "ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation" and [TPAMI'23] "ViTPose++: Vision Transformer for Generic Body Pose Estimation"

Python 1,481 192 Updated Jul 24, 2024

Official implementations for paper: Zero-shot Image Editing with Reference Imitation

Python 1,191 83 Updated Jun 15, 2024

Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.

Python 4,101 460 Updated Aug 22, 2024

Accepted as [NeurIPS 2024] Spotlight Presentation Paper

Jupyter Notebook 6,149 620 Updated Sep 26, 2024

[CVPR'24] Interactive3D: Create What You Want by Interactive 3D Generation

Python 178 7 Updated Sep 9, 2024

kapture is a file format as well as a set of tools for manipulating datasets, and in particular Visual Localization and Structure from Motion data.

Python 490 66 Updated Mar 11, 2024

DUSt3R: Geometric 3D Vision Made Easy

Python 5,742 622 Updated Sep 20, 2024

FaceChain is a deep-learning toolchain for generating your Digital-Twin.

Jupyter Notebook 9,254 867 Updated Dec 10, 2024

[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

Python 12,268 2,286 Updated Jun 26, 2024

[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model

Python 10,667 1,089 Updated Jun 21, 2024

Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation

14,586 982 Updated Jul 26, 2024

Official implementation of DreaMoving

1,798 96 Updated Jan 9, 2024

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

Python 7,624 650 Updated Aug 13, 2024

Robust Speech Recognition via Large-Scale Weak Supervision

Python 75,625 9,040 Updated Jan 4, 2025

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 65,886 7,042 Updated Feb 5, 2025

A plugin to add 360 and VR video support to video.js.

JavaScript 549 148 Updated Dec 10, 2024

Text To Video Synthesis Colab

Jupyter Notebook 1,489 181 Updated Mar 28, 2024

Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.

Python 20,319 2,061 Updated Nov 23, 2024

Cross-platform, customizable ML solutions for live and streaming media.

C++ 28,471 5,235 Updated Feb 5, 2025

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs

Python 11,267 2,376 Updated Feb 5, 2025

From comfyui workflow to web app, in seconds

Python 550 71 Updated Mar 21, 2024
Next