ISCA Archive Interspeech 2013
ISCA Archive Interspeech 2013

TUNDRA: a multilingual corpus of found data for TTS research created with light supervision

Adriana Stan, O. Watts, Y. Mamiya, M. Giurgiu, Robert A. J. Clark, Junichi Yamagishi, Simon King

Simple4All Tundra (version 1.0) is the first release of a standardised multilingual corpus designed for text-to-speech research with imperfect or found data. The corpus consists of approximately 60 hours of speech data from audiobooks in 14 languages, as well as utterance-level alignments obtained with a lightly-supervised process. Future versions of the corpus will include finer-grained alignment and prosodic annotation, all of which will be made freely available. This paper gives a general outline of the data collected so far, as well as a detailed description of how this has been done, emphasizing the minimal language-specific knowledge and manual intervention used to compile the corpus. To demonstrate its potential use, text-to-speech systems have been built for all languages using unsupervised or lightly supervised methods, also briefly presented in the paper.


doi: 10.21437/Interspeech.2013-545

Cite as: Stan, A., Watts, O., Mamiya, Y., Giurgiu, M., Clark, R.A.J., Yamagishi, J., King, S. (2013) TUNDRA: a multilingual corpus of found data for TTS research created with light supervision. Proc. Interspeech 2013, 2331-2335, doi: 10.21437/Interspeech.2013-545

@inproceedings{stan13b_interspeech,
  author={Adriana Stan and O. Watts and Y. Mamiya and M. Giurgiu and Robert A. J. Clark and Junichi Yamagishi and Simon King},
  title={{TUNDRA: a multilingual corpus of found data for TTS research created with light supervision}},
  year=2013,
  booktitle={Proc. Interspeech 2013},
  pages={2331--2335},
  doi={10.21437/Interspeech.2013-545}
}