ISCA Archive Interspeech 2013
ISCA Archive Interspeech 2013

A preliminary study of child vocalization on a parallel corpus of US and shanghainese toddlers

Hynek Bořil, Qian Zhang, Pongtep Angkititrakul, John H. L. Hansen, Dongxin Xu, Jill Gilkerson, Jeffrey A. Richards

This paper studies various aspects of child vocalization as captured in a newly established parallel corpus of sixteen 18.31 months old US and Shanghainese toddlers. The recordings were acquired in 16-hour sessions during an eordinaryf day in the child's natural environment and manually labeled. The vocalization characteristics are studied by means of phonotactic and prosodic analysis with emphasis on automatic processing. In the phonotactic domain, a Gaussian mixture model (GMM) tokenizer, a bank of phone recognizers, and formant tracking are used to analyze the movements in the acoustic-phonetic space. In the prosodic domain, pitch patterns, duration, and rhythm are analyzed. Besides strong individual-specific characteristics of the subjects in some of the domains considered, the two language groups show differences in the occupation of the F1 . F2 formant space, choice of pitch pattern durations, and consistency in producing complex phonetic patterns.


doi: 10.21437/Interspeech.2013-560

Cite as: Bořil, H., Zhang, Q., Angkititrakul, P., Hansen, J.H.L., Xu, D., Gilkerson, J., Richards, J.A. (2013) A preliminary study of child vocalization on a parallel corpus of US and shanghainese toddlers. Proc. Interspeech 2013, 2405-2409, doi: 10.21437/Interspeech.2013-560

@inproceedings{boril13_interspeech,
  author={Hynek Bořil and Qian Zhang and Pongtep Angkititrakul and John H. L. Hansen and Dongxin Xu and Jill Gilkerson and Jeffrey A. Richards},
  title={{A preliminary study of child vocalization on a parallel corpus of US and shanghainese toddlers}},
  year=2013,
  booktitle={Proc. Interspeech 2013},
  pages={2405--2409},
  doi={10.21437/Interspeech.2013-560}
}