SOTA discrete acoustic codec models with 40 tokens per second for audio language modeling
-
Updated
Dec 9, 2024 - Python
SOTA discrete acoustic codec models with 40 tokens per second for audio language modeling
FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.
A Survey of Spoken Dialogue Models (60 pages)
SimVQ: Addressing Representation Collapse in Vector Quantized Models with One Linear Layer
unofficial implementation of the High Fidelity Neural Audio Compression
Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'
A TTS app where you can clone the voices of any person you wish.
A collections of audio codecs with a standardized API
Experiments sonifying frame-level encodecmae features and encodecmae summary vectors using generative audio models.
code implementation of "High Fidelity Neural Audio Compression" from Meta AI's Encodec paper
Add a description, image, and links to the encodec topic page so that developers can more easily learn about it.
To associate your repository with the encodec topic, visit your repo's landing page and select "manage topics."