A one stop shop to track all open-access/ source TTS models as they come out. Feel free to make a PR for all those that aren't linked here.
This is aimed as a resource to increase awareness for these models and to make it easier for researchers, developers, and enthusiasts to stay informed about the latest advancements in the field.
Note
This repo will only track open source/access codebase TTS models. More motivation for everyone to open-source! 🤗
Name | GitHub | Weights | License | Fine-tune | Languages | Paper | Demo | Issues |
---|---|---|---|---|---|---|---|---|
Amphion | Repo | 🤗 Hub | MIT | No | Multilingual | Paper | 🤗 Space | |
Bark | Repo | 🤗 Hub | MIT | No | Multilingual | Paper | 🤗 Space | |
EmotiVoice | Repo | GDrive | Apache 2.0 | Yes | ZH + EN | Not Available | Not Available | Separate GUI agreement |
Glow-TTS | Repo | GDrive | MIT | Yes | English | Paper | GH Pages | |
GPT-SoVITS | Repo | 🤗 Hub | MIT | Yes | Multilingual | Not Available | Not Available | |
HierSpeech++ | Repo | GDrive | MIT | No | KR + EN | Paper | 🤗 Space | |
IMS-Toucan | Repo | GH release | Apache 2.0 | Yes | Multilingual | Paper | 🤗 Space | |
MahaTTS | Repo | 🤗 Hub | Apache 2.0 | No | English + Indic | Not Available | Recordings, Colab | |
Matcha-TTS | Repo | GDrive | MIT | Yes | English | Paper | 🤗 Space | GPL-licensed phonemizer |
Neural-HMM TTS | Repo | GitHub | MIT | Yes | English | Paper | GH Pages | |
OpenVoice | Repo | 🤗 Hub | CC-BY-NC 4.0 | No | ZH + EN | Paper | 🤗 Space | Non Commercial |
OverFlow TTS | Repo | GitHub | MIT | Yes | English | Paper | GH Pages | |
pflowTTS | Unofficial Repo | GDrive | MIT | Yes | English | Paper | Not Available | GPL-licensed phonemizer |
Piper | Repo | 🤗 Hub | MIT | Yes | Multilingual | Not Available | Not Available | GPL-licensed phonemizer |
Pheme | Repo | 🤗 Hub | CC-BY | Yes | English | Paper | 🤗 Space | |
RAD-TTS | Repo | GDrive | MIT | Yes | English | Paper | No | |
Silero | Repo | GH links | CC BY-NC-SA | No | EM + DE + ES + EA | Not Available | Not Available | Non Commercial |
StyleTTS 2 | Repo | 🤗 Hub | MIT | Yes | English | Paper | 🤗 Space | GPL-licensed phonemizer |
Tacotron 2 | Unofficial Repo | GDrive | BSD-3 | Yes | English | Paper | Webpage | |
TorToiSe TTS | Repo | 🤗 Hub | Apache 2.0 | Yes | English | Technical report | 🤗 Space | |
TTTS | Repo | 🤗 Hub | MPL 2.0 | No | ZH | Not Available | Colab | |
VALL-E | Unofficial Repo | Not Available | MIT | Yes | NA | Paper | Not Available | |
VITS/ MMS-TTS | Repo | 🤗 Hub / MMS | Apache 2.0 | Yes | English | Paper | 🤗 Space | GPL-licensed phonemizer |
WhisperSpeech | Repo | 🤗 Hub | MIT | No | English, Polish | Not Available | 🤗 Space, Recordings, Colab | |
XTTS | Repo | 🤗 Hub | CPML | Yes | Multilingual | Technical notes | 🤗 Space | Non Commercial |
xVASynth | Repo | GH commit | GPL-3.0 | Yes | Multilingual | Paper | Steam | Copyrighted materials used for training. |
Help make this list more complete. Create demos on the Hugging Face Hub and link them here :) Got any questions? Drop me a DM on Twitter @reach_vb.