COSMO SylPhon: A Bayesian Perceptuo-motor Model to Assess Phonological Learning

Marie-Lou Barnaud, Juien Diard, Pierre Bessière, Jean-Luc Schwartz


During speech development, babies learn to perceive and produce speech units, especially syllables and phonemes. However, the mechanisms underlying the acquisition of speech units still remain unclear. We propose a Bayesian model of speech communication, named “COSMO SylPhon”, for studying the acquisition of both syllables and phonemes. In this model, speech development involves a sensory learning phase mainly related to perception development and a motor learning phase mainly related to production development. We analyze how an agent can learn speech units during these two phases through an unsupervised learning process based on syllable stimuli. We show that the learning process enables to efficiently learn the distribution of syllabic stimuli provided in the environment. Importantly, we show that if agents are equipped with a bootstrap process inspired by the Frame-Content Theory of speech development, they learn to associate consonants to specific articulatory gestures, providing the basis for consonantal articulatory invariance.


 DOI: 10.21437/Interspeech.2018-73

Cite as: Barnaud, M., Diard, J., Bessière, P., Schwartz, J. (2018) COSMO SylPhon: A Bayesian Perceptuo-motor Model to Assess Phonological Learning. Proc. Interspeech 2018, 3786-3790, DOI: 10.21437/Interspeech.2018-73.


@inproceedings{Barnaud2018,
  author={Marie-Lou Barnaud and Juien Diard and Pierre Bessière and Jean-Luc Schwartz},
  title={COSMO SylPhon: A Bayesian Perceptuo-motor Model to Assess Phonological Learning},
  year=2018,
  booktitle={Proc. Interspeech 2018},
  pages={3786--3790},
  doi={10.21437/Interspeech.2018-73},
  url={http://dx.doi.org/10.21437/Interspeech.2018-73}
}