Stars
AI-based React component library that detects clapping sounds or finger snaps. Using a TensorFlow.js-based machine learning model, it accurately analyzes sounds in real-time.
Samples and Tools for Windows ML.
It shows a problem solver based on agentic workflow.
Papers, Datasets, Benchmarks for 3D Face (Reconstruction, Talking head, etc)
[ACM MM 2024] This is the official code for "AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding"
[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
Fine-Grained Open Domain Image Animation with Motion Guidance
AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation
[NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment
Kimwoonggon - Cpp Libtorch Dll with GPU Verson of YOLOv8 Seg and inference in C#
4th Place Solution for BirdCLEF 2023 Identify bird calls in soundscapes
KoLLaVA: Korean Large Language-and-Vision Assistant (feat.LLaVA)
Model to check if image was rotated by 90, 180, 270 degrees.
Visual localization made easy with hloc
The original sources of MS-DOS 1.25, 2.0, and 4.0 for reference purposes
[CVPRW 2022] Delving into High-Quality Synthetic Face Occlusion Segmentation Datasets
The implementation of the technical report: "Customized Segment Anything Model for Medical Image Segmentation"
[CVPR 2024 - Oral] Matching 2D Images in 3D: Metric Relative Pose from Metric Correspondences
This repository is for the first comprehensive survey on Meta AI's Segment Anything Model (SAM).
Implementation of Rotary Embeddings, from the Roformer paper, in Pytorch
PyTorch code for "EleGANt: Exquisite and Locally Editable GAN for Makeup Transfer" (ECCV 2022)