INTERSPEECH 2008
9th Annual Conference of the International Speech Communication Association

Brisbane, Australia
September 22-26, 2008

The Acoustic to Articulation Mapping: Non-Linear or Non-Unique?

Daniel Neiberg, G. Ananthakrishnan, Olov Engwall

KTH, Sweden

This paper studies the hypothesis that the acoustic-to-articulatory mapping is non-unique, statistically. The distributions of the acoustic and articulatory spaces are obtained by fitting the data into a Gaussian Mixture Model. The kurtosis is used to measure the non-Gaussianity of the distributions and the Bhattacharya distance is used to find the difference between distributions of the acoustic vectors producing non-unique articulator configurations. It is found that stop consonants and alveolar fricatives are generally not only non-linear but also non-unique, while dental fricatives are found to be highly non-linear but fairly unique. Two more investigations are also discussed: the first is on how well the best possible piecewise linear regression is likely to perform, the second is on whether the dynamic constraints improve the ability to predict different articulatory regions corresponding to the same region in the acoustic space.

Full Paper

Bibliographic reference.  Neiberg, Daniel / Ananthakrishnan, G. / Engwall, Olov (2008): "The acoustic to articulation mapping: non-linear or non-unique?", In INTERSPEECH-2008, 1485-1488.