ISCA Archive Interspeech 2005
ISCA Archive Interspeech 2005

A study on separation between acoustic models and its applications

Yu Tsao, Jinyu Li, Chin-Hui Lee

We study separation between models of speech attributes. A good measure of separation usually serves as a key indicator of the discrimination power of these speech models because it can often be used to indirectly determine the performance of speech recognition and verification systems. In this study, we use a probabilistic distance, called generalized log likelihood ratio (GLLR), to measure the separation between a model of a target speech attribute and models of its competing attributes. We illustrate five applications to compare separations among models obtained over multiple levels of discrimination capabilities, at various degrees of acoustic definitions and resolutions, under mismatched training and testing conditions, and with different training criteria and speech parameters. We demonstrate that the well-known GLLR distance and its corresponding histograms also provide a good utility to qualitatively and quantitatively characterize the properties of trained models without performing large scale speech recognition and verification experiments.

doi: 10.21437/Interspeech.2005-124

Cite as: Tsao, Y., Li, J., Lee, C.-H. (2005) A study on separation between acoustic models and its applications. Proc. Interspeech 2005, 1109-1112, doi: 10.21437/Interspeech.2005-124

  author={Yu Tsao and Jinyu Li and Chin-Hui Lee},
  title={{A study on separation between acoustic models and its applications}},
  booktitle={Proc. Interspeech 2005},