Lists (1)
Sort Name ascending (A-Z)
Stars
The world's 1st completely free, open-source ID Document Liveness Detection SDK which can detect fake ID cards, Driver Licenses and Passports.Ideal for developers looking for robust, fraud-preventi…
Face recognition and analytics library based on deep neural networks and ONNX runtime
👤🔍 | Face Detection , Gender and Age, Face Recognition, Facial Landmarks
Extraction of machine-readable zone information from passports, visas and id-cards via OCR
Passport document verifications using machine learning python sklearn
Age Estimation with PyTorch: Deep Learning for Predicting Age
H means Hardware. This PCB is just a prototype to test parallel eink signals with Raspberry PICO
The official Open-Asset-Importer-Library Repository. Loads 40+ 3D-file-formats into one unified and clean data structure.
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
A library for detecting and resolving intersections between two surface meshes.
A procedural geometry generation library for C++11
Easily train a good VC model with voice data <= 10 mins!
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
Unified-Modal Speech-Text Pre-Training for Spoken Language Processing
An Open Source text-to-speech system built by inverting Whisper.
A generative speech model for daily dialogue.
A multi-voice TTS system trained with an emphasis on quality
Foundational model for human-like, expressive TTS
🦔 PostHog provides open-source web & product analytics, session recording, feature flagging and A/B testing that you can self-host. Get started - free.
Live, low-latency 2D and 3D tracking from single or multiple high-speed cameras
Learning Locker - The Open Source Learning Record Store. Started in 2014.
[CVPR 2024 Highlight] The official repo for "GaussianAvatars: Photorealistic Head Avatars with Rigged 3D Gaussians"
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
Talking Head (3D): A JavaScript class for real-time lip-sync using Ready Player Me full-body 3D avatars.
Industry leading face manipulation platform
Real time background replacement using DeepLabv3 MobileNetv2 model for person segmentation and OpenCV for image processing.
🦜🔗 Build context-aware reasoning applications
[CVPR2023] The implementation for "DiffTalk: Crafting Diffusion Models for Generalized Audio-Driven Portraits Animation"