Stars
OpenFace – a state-of-the art tool intended for facial landmark detection, head pose estimation, facial action unit recognition, and eye-gaze estimation.
The source code of "DINet: deformation inpainting network for realistic face visually dubbing on high resolution video."
A set of nodes for ComfyUI that can composite layer and mask to achieve Photoshop like functionality.
LoRA & Dreambooth training scripts & GUI use kohya-ss's trainer, for diffusion model.
Learning Compressed Representation of 3DLUT for Image-enhancement. Higher performance with much smaller models!
AI based multi-label girl image classification system, implemented by using TensorFlow.
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
so-vits-svc fork with realtime support, improved interface and more features.
[CVPR 2022] Thin-Plate Spline Motion Model for Image Animation.
Code for Cvpr2021 "CT-Net: Complementary Transfering Network for Garment Transfer with Arbitrary Geometric Changes"
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
本项目基于SadTalkers实现视频唇形合成的Wav2lip。通过以视频文件方式进行语音驱动生成唇形,设置面部区域可配置的增强方式进行合成唇形(人脸)区域画面增强,提高生成唇形的清晰度。使用DAIN 插帧的DL算法对生成视频进行补帧,补充帧间合成唇形的动作过渡,使合成的唇形更为流畅、真实以及自然。
Code for MeshTalk: 3D Face Animation from Speech using Cross-Modality Disentanglement
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Create 🔥 videos with Stable Diffusion by exploring the latent space and morphing between text prompts
Official tensorflow implementation for CVPR2020 paper “Learning to Cartoonize Using White-box Cartoon Representations”
Cartoonize images using traditional computer vision, then train a GAN to do it
Official implementation of "DCT-Net: Domain-Calibrated Translation for Portrait Stylization", SIGGRAPH 2022 (TOG); Multi-style cartoonization