Name		Name	Last commit message	Last commit date
parent directory ..
docs		docs
evaluation		evaluation
preprocessing		preprocessing
README.md		README.md
__init__.py		__init__.py
data_utils.py		data_utils.py
generate_waveform.py		generate_waveform.py
utils.py		utils.py

README.md

Speech Synthesis (S^2)

https://arxiv.org/abs/2109.06912

Speech synthesis with fairseq.

Features

Autoregressive and non-autoregressive models
Multi-speaker synthesis
Audio preprocessing (denoising, VAD, etc.) for less curated data
Automatic metrics for model development
Similar data configuration as S2T

Examples

Citation

Please cite as:

@article{wang2021fairseqs2,
  title={fairseq S\^{} 2: A Scalable and Integrable Speech Synthesis Toolkit},
  author={Wang, Changhan and Hsu, Wei-Ning and Adi, Yossi and Polyak, Adam and Lee, Ann and Chen, Peng-Jen and Gu, Jiatao and Pino, Juan},
  journal={arXiv preprint arXiv:2109.06912},
  year={2021}
}

@inproceedings{ott2019fairseq,
  title = {fairseq: A Fast, Extensible Toolkit for Sequence Modeling},
  author = {Myle Ott and Sergey Edunov and Alexei Baevski and Angela Fan and Sam Gross and Nathan Ng and David Grangier and Michael Auli},
  booktitle = {Proceedings of NAACL-HLT 2019: Demonstrations},
  year = {2019},
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

speech_synthesis

speech_synthesis

README.md

Speech Synthesis (S^2)

Features

Examples

Citation

Files

speech_synthesis

Directory actions

More options

Directory actions

More options

Latest commit

History

speech_synthesis

Folders and files

parent directory

README.md

Speech Synthesis (S^2)

Features

Examples

Citation