This paper proposes a method for automatically clustering multilingual speech so as to derive language family trees. We consider that the language is the source of information which generates speech feature parameters; the probability or statistical characteristics of this information is modeled by Gaussian mixture models (GMMs); then a distance measure between the GMMs is introduced. Based on this, we construct family trees of multilingual speech which are quite similar to those considered in linguistics.
Cite as: Itahashi, S., Zhu, S., Yamamoto, M. (2005) Constructing family trees of multilingual speech using Gaussian mixture models. Proc. Interspeech 2005, 2253-2256, doi: 10.21437/Interspeech.2005-712
@inproceedings{itahashi05_interspeech, author={Shuichi Itahashi and Shiwei Zhu and Mikio Yamamoto}, title={{Constructing family trees of multilingual speech using Gaussian mixture models}}, year=2005, booktitle={Proc. Interspeech 2005}, pages={2253--2256}, doi={10.21437/Interspeech.2005-712} }