Stars
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
Code for the AAAI 2018 publication "SEE: Towards Semi-Supervised End-to-End Scene Text Recognition"
Automatic differentiation with weighted finite-state transducers.
Applications using the GTN library and code to reproduce experiments in "Differentiable Weighted Finite-State Transducers"
Facebook AI Research's Automatic Speech Recognition Toolkit
A C++ standalone library for machine learning
Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.