Stars
[ICCV 2023] Official implementation of the paper "A Simple Framework for Open-Vocabulary Segmentation and Detection"
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
NeuralProphet: A simple forecasting package
The Replica Dataset v1 as published in https://arxiv.org/abs/1906.05797 .
Code for "PVNet: Pixel-wise Voting Network for 6DoF Pose Estimation" CVPR 2019 oral
Dear ImGui: Bloat-free Graphical User interface for C++ with minimal dependencies
Use Vim everywhere you've always wanted to
๐ A collection of pure POSIX sh alternatives to external processes.
antimatter15 / alpaca.cpp
Forked from ggerganov/llama.cppLocally run an Instruction-Tuned Chat-Style LLM
Learning Vim and Vimscript doesn't have to be hard. This is the guide that you're looking for ๐
Learning to Regress 3D Face Shape and Expression from an Image without 3D Supervision
Python tools for 3D face: 3DMM, Mesh processing(transform, camera, light, render), 3D face representations.
Clone a voice in 5 seconds to generate arbitrary speech in real-time
A multi-voice TTS system trained with an emphasis on quality
AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss
Allosaurus is a pretrained universal phone recognizer for more than 2000 languages
High performance self-hosted photo and video management solution.
A single-header ANSI C immediate mode cross-platform GUI library
A presenter console with multi-monitor support for PDF files.
Offline private voice assistant for many human languages
The ultimate tool to interact with your audience
Code release for "Detecting Twenty-thousand Classes using Image-level Supervision".
Robust Speech Recognition via Large-Scale Weak Supervision
Official implementation of "Video Prediction at Multiple Spatio-Temporal Scales with Hierarchical Recurrent Networks." by Villar-Corrales et al.
Automated, hardware-independent Hand-Eye Calibration