An Automatically Aligned Corpus of Child-Directed Speech

Micha Elsner, Kiwako Ito


Forced alignment would enable phonetic analyses of child directed speech (CDS) corpora which have existing transcriptions. But existing alignment systems are inaccurate due to the atypical phonetics of CDS. We adapt a Kaldi forced alignment system to CDS by extending the dictionary and providing it with heuristically-derived hints for vowel locations. Using this system, we present a new time-aligned CDS corpus with a million aligned segments. We manually correct a subset of the corpus and demonstrate that our system is 70% accurate. Both our automatic and manually corrected alignments are publically available at osf.io/ke44q.


 DOI: 10.21437/Interspeech.2017-379

Cite as: Elsner, M., Ito, K. (2017) An Automatically Aligned Corpus of Child-Directed Speech. Proc. Interspeech 2017, 1736-1740, DOI: 10.21437/Interspeech.2017-379.


@inproceedings{Elsner2017,
  author={Micha Elsner and Kiwako Ito},
  title={An Automatically Aligned Corpus of Child-Directed Speech},
  year=2017,
  booktitle={Proc. Interspeech 2017},
  pages={1736--1740},
  doi={10.21437/Interspeech.2017-379},
  url={http://dx.doi.org/10.21437/Interspeech.2017-379}
}